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Access all areas 


To create a sustainable, open research literature, governments need to find the finances to make it 
viable — and recognize that adding value to diversifying research outputs has its own costs. 


towards a truly open research literature. But the underlying 
questions have hardly been touched on: who needs whom to 
add what value to what literature, and who is willing to pay for it? 

Consider first a ridiculous distraction: the US Research Works Act. 
Proposed late last year in the House of Representatives, this seeks to 
stop US federal funders from mandating that research papers be made 
freely available. The current policy of the US National Institutes of 
Health — a policy supported and assisted by Nature and its publishers 
— is that authors’ final versions of papers should be deposited in the 
freely accessible PubMed Central database within a year of publica- 
tion. That policy would be prohibited by the Research Works Act. The 
proposal has provoked an outcry, stimulating several publishers to state 
their opposition (including, last week, ours; see go.nature.com/myil4g). 

Why is this a ridiculous distraction? Because it tries to reverse a slow 
but strong political tide that is in favour of access, and because even its 
supporters believe that it has no chance of passing. 

Next, consider the more serious and necessary debate about online 
theft of copyrighted content. In 2002, Nature’s publishers resolved that 
the authors of original research papers should retain copyright while 
giving our journals an exclusive licence to publish, but there remain jus- 
tified concerns in science publishing and more generally that unlicensed 
online distribution threatens the viability of producers of valued content. 

Two draft measures introduced to Congress last year — the 
PROTECT Intellectual Property Act and the Stop Online Piracy Act 
— intended to counteract such threats, but triggered huge opposition 
because of the threat of collateral damage to Internet activity. This 
climaxed last week with a day-long blackout of Wikipedia and hostile 
statements by several publishers including Nature’s (see go.nature. 
com/kttyax). Both proposals have been put on hold, and the piracy 
act will be redrafted. 

Despite these skirmishes, the vision of an open research literature 
has both scientific merit and strong international political support. But 
there are still substantive issues regarding the future of the primary 
research literature, which are unlikely to be resolved for years. 


r The past week has seen several twists and turns along the road 


ADDING VALUE 

No one disagrees that a publisher of review articles deserves to charge 
for access to them. After all, the publisher's staff have contributed value 
in various ways: identifying the author and the article's aim, assessing 
and editing the draft, selecting peer reviewers, working with the author 
to build on their advice, developing illustrations, rendering the article 
into print and online forms, maintaining it online and including links, 
citation statistics and other enhancements. 

A publisher of research papers also does all of these things, except 
that authors voluntarily submit the article, the editors undertake 
careful assessment of scientific significance, and the refereeing stage 
involves much deliberation, occasional debate and revisions that 


significantly enhance the robustness and scientific impact of the paper. 
The publishers and editors of Nature and its related journals remain 
committed to sustaining and developing these components of added 
value. For example, we organize researcher meetings to review our 
acceptance and technical standards in fast-moving fields, we upgrade 
the online search and other facilities surrounding our papers, and we 
are investing in improved presentation of online data and illustrations. 
As for the business model, anyone who wishes to preserve these modes 
of added value should favour a publishing 


“The vision of an — eyolution with a mixed economy of author- 
open research pays open access and subscriptions, with 
literature has some journals, such as Nature Communica- 
both scientific tions, allowing both options (see go.nature. 
merit and com/slwrt3). 

strong political But the literature itself is changing. It no 
support.” longer consists of only static papers that 


document a research insight. In the future, 
online research literature will, in an ideal world at least, be a seamless 
amalgam of papers linked to relevant data, stand-alone data and soft- 
ware, grey literature’ (policy or application reports outside scientific 
journals) and tools for visualization, analysis, sharing, annotation and 
providing credit. 

And ‘publishers’ will increasingly include organizations or individuals 
who are not established journal publishers, but who host and provide 
access and other added value to this online edifice. Some may be 
research funders, such as the National Institutes of Health in its hosting 
of various databases; some may be research institutions, such as the 
European Bioinformatics Institute. Others may be private companies, 
including suppliers of tools such as the reference manager Mendeley and 
Digital Science, sister company to Nature Publishing Group. 

This literature will need to be readable and computable not only by 
people but also by machines, which will, in turn, require publishers 
to develop new standards. 

In short, the literature is becoming ever more multifaceted, and inter- 
mediaries will be needed to supply added value and usability. It is hard 
to imagine such a primary literature and all of those seeking to add 
genuine value to it thriving when its key results are behind subscription 
firewalls. But a vision for open access in which all results — text, data, 
grey literature and so on — are immediately available in their published 
versions requires the costs of that added value to be paid for. 

None of this will occur until the tide in its favour becomes unstop- 
pable. The only way that can happen is for governments to recognize 
the complexities of this terrain, and the damage that can be done to 
the providers of added value and to research itself as a result of poorly 
considered prohibitions or compulsions. Above all, they need to find 
the money to make the vision viable. Only then will the open research 
literature truly come to fruition, and only then will those wishing to 
provide added value be able to invest confidently in doing so. m 
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Fertile union 


Scientists and politicians are working together 
to bring new reproductive techniques to Britain. 


‘science-based’ policy-making is anything but. But sometimes, 
they get it right, and when they do, they deserve praise. 

One such case is last week’s announcement that Britain intends 
to develop in vitro fertilization (IVF) techniques that could cut the 
number of children born with devastating genetic conditions such as 
muscular dystrophy. Not only are scientists, lawmakers and ethicists 
speaking the same language, they are also synchronizing their efforts to 
make Britain the first country to test the techniques in humans, taking 
it light years ahead of other nations (see page 419). 

Uniquely among IVF procedures, the new techniques involve 
embryos that combine genetic material from three people. The pros- 
pect ofa child with three genetic parents has inevitably raised concern 
among some commentators, and among politicians worried about what 
those commentators will say. 

The diseases targeted by these techniques are passed to children by 
mothers through faulty mitochondria in their eggs. The techniques 
therefore remove the genetic material from the nucleus of the faulty egg 
and insert it into a healthy egg that has been stripped of its own nucleus. 
This is where the third ‘parent’ comes in: a different woman must sup- 
ply the healthy egg. A child born from IVF using one of the modified 
eggs will therefore carry the genes of three people: nuclear DNA from 
both parents and mitochondrial DNA from the donor. 

Concerns over the science, safety and ethics of the procedures 
prompted the British government to ban them in 2008, in an amend- 
ment to the Human Fertilisation and Embryology Act, which governs 
fertility treatments and research. But the law also anticipated scientific 
advances in these technologies and set out a streamlined mechanism 
to legalize the procedures if a future government saw fit. Since then, 
one of the techniques has been used to produce two healthy rhesus 
monkeys and the other has been tested successfully in defective human 
eggs fertilized in vitro. 

Britain has now decided that the time is right to revisit its legislative 


r | 100 often, scientists and policy-makers talk past one another and 


ban. A scientific review released in April 2011 by the Human Fertilisa- 
tion and Embryology Authority (HFEA) — the agency that would regu- 
late the procedures — found no scientific evidence that the techniques 
are unsafe, but did recommend additional studies that would be needed 
before clinical trials could begin. Many of those studies are now under 
way or soon will be. The Wellcome Trust, Britain’s biggest biomedical 

charity, announced last week that it would pay for some of them. 
Legislators and regulators are not standing idle while the scientists 
work away. Simultaneously, and in anticipation of positive scientific 
findings, the HFEA announced a public consultation on the tech- 
niques. The consultation is the first step towards legalizing clinical 
trials. Under the 2008 change to the fertility 


“Elected act, the government can write an amendment 
offi icialsdoseem to legalize the procedures that would require 
to be giving only brief discussion before being voted on 
the issue a fair by Parliament. 

and informed Any number of issues could yet derail the 
hearing. “a trials. Additional research could raise safety 


problems or find the process too inefficient. 
The public, perhaps swayed by ‘three-parent baby’ headlines, could 
resist. And the Nuffield Council on Bioethics in London, which has 
just started an independent review of the techniques, could iden- 
tify more ethics qualms. Allowing changes to the mitochondria of 
an embryo, for instance, could increase the chances that changes to 
nuclear DNA will be allowed in order to treat other conditions, such as 
cystic fibrosis. In exploring that path, we eventually come to designer 
babies, with non-medical tinkering to develop certain traits. 

Critics may say that the UK government could have addressed these 
issues more quickly by legalizing the procedures and then leaving reg- 
ulators to decide when the science was sufficient. But elected officials 
do seem to be giving the issue a fair and informed hearing, and are 
placing science at its centre. 

The situation is very different in other countries. A government 
review of embryo research in Australia last year recommended against 
changing the law to legalize clinical use of the procedures. And in 
the United States, regulators consider the techniques to be a form of 
gene therapy, controlled by the Food and Drug Administration, which 
seems in no hurry to consider their clinical use. 

About one in 200 women passes a mitochondrial disease on to her 
children. In Britain, those children have a chance of a better future 
because scientists and politicians, for once, are seeing eye to eye. m 


Notes on screen 


Computer tablets are changing the way that 
scientists record their experiments. 


Laboratory at the University of Cambridge, UK, says that lab notes 

“need not be particularly tidy, but they should be understandable 
by the writer or somebody else at a later date”. Written in 2008, the 
guidance adds: “Your notebook must be A4 in size and hard-bound. 
A suitable book can be bought from the laboratory technician.” 

So far, no doubt, so familiar — but technology is marching on, and 
commuters are starting to abandon dog-eared paperbacks for e-books. 
For how much longer will the lab book prevail in its current form? 
And how many more notebooks will the Cavendish technicians sell? 

Reports of the death of the standard lab book — in use for hundreds 
of years — are, of course, as premature as they are exaggerated. And 
Nature has been here before — in a feature in 2005, we reported that 
electronic notebooks were poised to become increasingly popular 
among researchers (see Nature 436, 20-21; 2005). The News Feature 


E its introductory handbook for physics students, the Cavendish 
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on page 430 of this issue, which takes a look at the rise of the digital 
lab, shows that we were right. 

“Paper has nothing to offer me,” says Michelle James, an Alzhei- 
mer’s disease researcher at Stanford University in California who is 
profiled in the feature, and who has moved her scientific notes to her 
iPad. James is far from alone — a generation of bench scientists is 
ditching paper and taking advantage of computer tablets and software 
that allow people to share protocols and swap notes. (If it cheers the 
old guard, who even now are vowing never to abandon their trusty 
notebooks and pencils, the digital-savvy researchers must place their 
fancy kit in plastic bags to protect it from spills.) 

There is more to this than the migration of content from print to 
web. Just as newspapers have been able to exploit the Internet to reach 
readers and build communities in ways that they could not have imag- 
ined when they first started placing their copy online, so powerful 
processors and the digitization of data could let researchers analyse 
their results much earlier in the scientific process than is common now. 

Such an approach is not completely new, but digitization makes 
it easier. The Cavendish introductory notes 
say: “Ideally you should plot graphs as you go 
along, not after completing the experiment, 
though in practice this is not always possible.” 
It is now. = 
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RESEARCH HIGHLIGHTS 


Culprits in 
diabetic heart risk 


People with type 1 diabetes 
have an above-average risk 
of developing heart disease, 
or atherosclerosis, and two 
related groups of inflammatory 
immune cells may be to blame. 
Immune cells called 
monocytes and macrophages 
— which develop from bone 
marrow — from mice and 
humans with diabetes express 
unusually high levels of the 
enzyme ACSL1. This modifies 
fat molecules that then trigger 
inflammation, according 
to Karin Bornfeldt at the 
University of Washington in 
Seattle and her team. Diabetic 
mice that received transplanted 
bone-marrow cells lacking 
ACSLI developed fewer and 
smaller blood-vessel lesions 
characteristic of atherosclerosis 
than did diabetic mice 
transplanted with normal cells. 
Blocking this enzyme 
could prevent the accelerated 
atherosclerosis common in 
patients with diabetes, the 
authors suggest. 
Proc. Natl Acad. Sci. USA 
http://dx.doi.org/10.1073/ 
pnas.1111600109 (2012) 


MATERIALS 


Circuit, heal 
thyself 


A crack in an electrical 
circuit no longer needs to 
be permanent, thanks to the 


Selections from the 
scientific literature 


ANIMAL BEHAVIOUR 


Snakes strangle with feeling 


As boa constrictors tighten their bodies around 
their prey, the snakes sense the dying animal's 
heart rate to determine how much pressure to 
apply and when to stop squeezing. 

Scott Boback and his colleagues at 
Dickinson College in Carlisle, Pennsylvania, 
implanted simulated hearts into warm, dead 
rats and presented them to 16 snakes. In the 
presence of a heartbeat, the boas constricted 
for nearly twice as long and with more than 


development of circuits that 
can ‘heal’ themselves. The 
incorporation of tiny capsules 
of liquid metal allows electrical 
circuits to recover almost all 
of their function after being 
broken. 

Nancy Sottos, Scott White 
and their colleagues at the 
University of Illinois at 
Urbana-Champaign created 
capsules of gallium and 
indium (pictured) and placed 
them either directly onto gold 
circuits or into an insulating 
layer above the wires. Ifa crack 
breaks the circuit and ruptures 
the capsules, the liquid metal is 
released, filling in the crack. 
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their prey. 


(2012) 


After damage, the 
researchers’ circuit recovered 
99% of its conductance in less 
than 1 millisecond. However, 
the team’s method may not be 
immediately applicable because 
the demonstration circuit is 
significantly larger than most 
current micro-electronics. 

Adv. Mater. 24, 398-401 (2012) 


} CANCER 
Tumour cells 
lend a hand 


Cells that provide tumours 
with support may also defend 
against the cancer’s spread, 


double the pressure than in the absence ofa 
heartbeat. When the heart stopped midway 
through constriction, the snakes released 


The researchers speculate that the ability to 
detect a heartbeat co-evolved with constriction 
(pictured) to help snakes identify when death 
occurs and minimize energy expenditure. 

Biol. Lett. http://dx.doi.org/10.1098/rsbl.2011.1105 


so chemotherapies that target 
these cells could inadvertently 
fuel metastasis. 

Pericytes are cells that 
provide structural support 
to blood vessels, including 
those that feed tumours. 
Raghu Kalluri at Harvard 
Medical School in Boston, 
Massachusetts, and his group 
found that patients with 
invasive breast cancer tended 
to fare poorly if they had few 
pericytes covering the blood 
vessels in their tumours. 
Selectively killing pericytes 
in tumour-bearing mice 
slowed tumour growth but 
increased metastasis and 


S. BOBACK 


the expression of a cancer- 
promoting gene called Met. 
Two chemotherapies that 
target pericytes — imatinib 
and sunitinib — produced 
similar effects in mice, but 
their effects on metastasis 
were suppressed when a 
Met inhibitor was given 
simultaneously. The authors 
suggest that pericyte loss 
makes tumour blood vessels 
leaky, which may set in motion 
events that promote metastasis. 
Cancer Cell 21, 66-81 (2012) 


PHYSICS 


One molecule, 
one photon 


To harness quantum 
technology for applications 


— is shuttled through several 
organelles in the infected cell 
and eventually shuts down 
cellular protein production. 
Somshuvra Mukhopadhyay 
and Adam Linstedt at 
Carnegie Mellon University 
in Pittsburgh, Pennsylvania, 
fluorescently tagged the 

toxin and found that, in cells 
treated with manganese, the 
toxin was rerouted to the cell's 
degradative compartment and 
destroyed. Manganese targets 
a protein called GPP130 — 
which is normally required 
for the trafficking of the toxin 
— preventing the toxin from 
binding to it. 

Mice injected with the toxin 
and treated with manganese 
stayed healthy throughout 
the six-day study, whereas 
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Warming, but not as much 


3 HIGHLY READ 


The climate system may be less sensitive 
to greenhouse-gas warming than many 
models have predicted. 

Nathan Gillett and his co-workers at 
Environment Canada in Victoria, British 


Columbia, analysed how well the latest Canadian Earth System 
Model tracked temperature changes attributable to volcanoes, 
man-made aerosols and rising greenhouse-gas emissions. 
They adjusted the model using temperature records from 1851 
to 2010 — 60 years of data more than most previous analyses. 
The model predicted a short-term increase of 1.3-1.8°C fora 
doubling of atmospheric carbon dioxide levels, which is low in 
the range of estimates from previous forecasts. 

Under various scenarios, the authors’ model forecasts 


T. CHILD 


untreated mice died within 
cryptography, researchers must four days. with pre-industrial times. 

develop materials that can emit Science 335, 332-335 (2012) Geophys. Res. Lett. http://dx.doi.org/10.1029/2011GL050226 
single photons in response to (2012) 


electrical signals. However, 

existing approaches require an 

ultracold environment. Ready fo r th e 
toxic toads 


Maximilian Nothaft at 
the University of Stuttgart in 

Exposure to an exotic plant 
seems to have pre-adapted a 


Germany and his colleagues 
native Australian lizard to the 


such as computing and warming of 1.2-4.3°C by the end of the century compared 


various regions of Australia 
to either the toad or plant 
toxin. They found that skinks 
from areas where the plant 

is abundant were better able 
to tolerate both. Moreover, 
skinks readily eat both the 
plants and the toads. The 
findings suggest that the plants 
have selected for skinks that 
could tolerate the toad toxins. 
Am. Nat. http://dx.doi. 
org/10.1086/664184 (2012) 


Mutation and 


Federal Institute of Technology 
in Zurich and his colleagues 
activated the immune system 
of pregnant mice that had only 
one copy of the Nurr1 gene. 
They found that 
dopaminergic development 
in key parts of the brain was 
impaired in Nurr1-deficient 
pups born to immune- 
activated mothers. As adults, 
these mice were unable to 
sustain or shift attention 
normally, a characteristic of 
neurodevelopmental diseases 
such as schizophrenia and 


tested phosphorescent iridium- 
based organic molecules as 
the active layer in a light- 
emitting diode (LED) at room 
temperature. They showed 
that the molecules could 
emit photons in response to 
electrical and laser stimulation. 
The distribution of the emitted 
photons suggests that each was 
emitted bya single molecule. 
The authors say that their 


toxins of an invasive animal. 


method should allow for attention-deficit hyperactivity 
further study of photon- infection to blame disorder. 
emission mechanisms in J. Neurosci. 32, 436-451 (2012) 
organic LEDs. The American cane toad A combination of genetic and 
Nature Commun. http://dx.doi. (Rhinella marina), an exotic environmental factors may act 
org/10.1038/ncomms1637 introduced to Australia within synergistically to boost the risk CORRECTION 
(2012) the past century, is generally of certain neurodevelopmental In the Research Highlight 

toxic to native predators. disorders. ‘Magnetic switch for 
But Richard Shine and his In humans, mutations in memory’ (Nature 481, 

. colleagues at the University NURRI —a gene vital for 241; 2012), the authors 

Manganese fights of Sydney observed that the normal development switched the magnetic states 
deadl y toxin bluetongue skinks (Tiliqua of neurons that produce of antiferromagnetic iron 

scincoides; pictured) in some a neurotransmitter called atoms by applying a voltage, 
A toxin produced by certain areas can tolerate the toad’s dopamine — are thought not a current, of more than 
deadly strains of pathogenic toxins. The toxins are similar to slightly raise the risk of 7 millivolts. 
bacteria can be stopped in to those produced by another neurodevelopmental disorders. 
its tracks by the element exotic species, the Madagascan The risk is similarly increased 
manganese. ‘mother of millions’ plant by infection in the mother > NATURE.COM 

Shiga toxin — generated (Bryophyllum spp.). during pregnancy. To test the For the latest research published by 

by Shigella bacteria and some The authors tested the combined effect of these two Nature visit: 


strains of Escherichia coli resistance of skinks from factors, Urs Meyer at the Swiss www.nature.com/latestresearch 
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SEVEN DAYS neni 


Leap-second vote 


A decision about whether 

to ditch the leap second 
—amove that would pull 
reference time out of sync 
with the rising and setting of 
the Sun — has been deferred 
until 2015. Delegates to the 
World Radiocommunication 
Assembly of the International 
Telecommunication Union 
in Geneva, Switzerland, were 
supposed to vote on the issue 
last week, but couldn't reach a 
consensus. See go.nature.com/ 
tucaxg for more. 


Untreatable TB? 


India’s government has 
dismissed reports that a form 
of incurable tuberculosis 

has arrived in the country. 
Last year, researchers at the 
Hinduja National Hospital 

in Mumbai reported 12 cases 
of totally drug-resistant 
tuberculosis (Z. EF Udwadia 
et al. Clin. Infect. Dis. http:// 
doi.org/hpd; 2011). This 
would have made India the 
third country, after Italy and 
Iran, in which an untreatable 
form of the disease had 
emerged. But in a 17 January 
statement, the health ministry 
said doctors had found that 
seven of the patients were 
responding to treatment, and 
that the disease was in fact 
extensively-drug-resistant 
tuberculosis. See go.nature. 
com/hygn2o for more. 


West Bank rebuke 


The European Commission 
and London's venerable 
Natural History Museum 

are facing criticism for 
working with a company that 
has facilities inside Israeli 
settlements in the West 

Bank occupied territories — 
settlements that international 
law has determined are 
illegal. The company, Israeli 
cosmetics firm Ahava Dead 
Sea Laboratories, is part of 


Tar sands pipeline pulled 


US President Barack Obama has blamed 
politics for his rejection of an application to 
build a 2,700-kilometre-long pipeline to carry 
oil from tar sands in Alberta, Canada, to the 
coast of Texas. Congress had forced a deadline 
of 21 February for the decision, which did not 


a €5.19-million (US$6.7- 
million) research project on 
the toxicity of nanoparticles. 
On 17 January, 21 academics 
and public figures wrote to 
the British newspaper The 
Independent decrying the 
museum's involvement with 
the firm. See go.nature.com/ 
vxuwwm for more. 


NIH salary cut 


The maximum salary that 
institutions can pay toa 
biomedical researcher out of 
a grant from the US National 
Institutes of Health has been 
cut by 10% to $179,700. The 
agency told grant recipients 
about the tightened salary 
cap (which applies to grants 
won after 22 December) on 
20 January. Academic medical 
centres will have to dig into 
other budgets if they wish 

to keep wages high. The cut 
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leave time for the state department to assess 
the project, Obama said on 18 January. The 
decision pleased environmental campaigners, 
but TransCanada, the Canadian company 
hoping to build the Keystone XL pipeline, said 


that it would reapply. 
could have been worse — public meeting will be held 
last October, Congress was on 13 February. See go.nature. 
considering legislating a 17% com/s6sgw8 for more. 
cut. See go.nature.com/jdk9gh 
for more. Pe BUSINESS 
Genome reshuffle Stem-cell promise 


Genome scientists and 
genetics societies have 
expressed support for a 
planned reorganization of 

the US National Human 
Genome Research Institute. 

At a public teleconference 

on 18 January, biologists 
welcomed the changes, which 
create new divisions to help the 
institute shift the focus of its 
research from understanding 
the structure and biology of 
genomes, to translating this 
information into better medical 
care. (Priorities were laid out 
ina planning paper last year; 
see E. D. Green et al. Nature 
470, 204-213; 2011.) Another 


Two clinical trials testing 
retinal cells derived from 
human embryonic stem 

cells have reported positive 
preliminary results. 
Advanced Cell Technology, 

a biotechnology firm in 
Marlborough, Massachusetts, 
says that the cells seem 

to be safe 4 months after 
being injected into the 

eyes of two patients with 
differing forms of blindness 
(S. D. Schwartz et al. Lancet 
http://dx.doi.org/10.1016/ 
S0140-6736(12)60028-2; 
2012). The trials — which are 
testing safety, not efficacy — 
will eventually enrol a dozen 
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EUROPEAN UNION 
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patients each, with final results 
expected in 2013. See go.nature. 
com/g3mjg7 for more. 


Nuclear looting 


Radioactive material was stolen 
last week from a nuclear power 
plant being built in Egypt, 
highlighting the dangers of 
nuclear looting during social 
upheaval. The International 
Atomic Energy Agency said 
that low-level radioactive 
sources were taken from a lab at 
the El Dabaa nuclear plant on 
Egypt's Mediterranean coast. 
This is not the first example 

of such theft: for instance, 
looters stole caesium and cobalt 
radioactive sources from a 
nuclear centre in Baghdad in 
2003, after the US-led invasion 
of Iraq. See go.nature.com/ 
u6g8se for more. 


PEOPLE 
European funder 


Teresa Riera Madurell 
(pictured), a Spanish 
politician and former 
computer scientist, will be 

the rapporteur for Europe's 
enormous 2014-20 research 
funding programme (Horizon 
2020), Nature has learned. 

As rapporteur, Riera will 
shepherd committee reports 
on the programme through 
the European Parliament, and 
will influence decisions about 
its funds and structure. Under 


Biodiesel firm Renewable 
Energy, based in Ames, Iowa, 
made the first US initial public 
offering (IPO) of 2012, raising 
US$72 million on 19 January. 


Companies hope to improve on 


2011, which saw only 21 clean- 


energy IPOs globally, say analysts 
Bloomberg New Energy Finance. 
The sector’s three largest IPOs last 
year were Chinese firms: Sinovel 
Wind Group ($1.4 billion); wind- 
energy firm Huaneng Renewables 
($850 million); and photovoltaic 
maker Beijing Jingyuntong 
Technology ($394 million). 


proposals currently being 
considered, the programme 
would distribute around 
€80 billion (US$104 billion) 
for research. 


Brazilian minister 


Brazil's new science minister 

is Marco Antonio Raupp, 
formerly head of the 

Brazilian space agency. From 
24 January, he replaces Aloizio 
Mercadante, who moves to the 
education ministry in a cabinet 
reshuffle. The announcement 
was made on 18 January. 


Crafoord winners 
Astrophysicist Reinhard 
Genzel, at the Max Planck 
Institute for Extraterrestrial 
Physics in Garching, Germany, 
and astronomer Andrea Ghez, 
at the University of California, 
Los Angeles (UCLA), have 
won the 2012 Crafoord Prize 
in Astronomy for their work 
showing that there is probably 
a black hole at the centre of the 
Milky Way. The prize, worth 

4 million Swedish kronor 
(US$590,000), rotates through 


THE IPO ROLLERCOASTER 


several disciplines. This year, 
a separate mathematics prize 
was given to mathematicians 
Terence Tao at UCLA and Jean 
Bourgain at the Institute for 
Advanced Study in Princeton, 
New Jersey, for their work 

in areas including harmonic 
analysis, partial differential 
equations and ergodic theory 
of dynamical systems. 


Early-career grants 
The Howard Hughes Medical 
Institute in Chevy Chase, 
Maryland, will dole out a total 
of more than US$20 million 
to 28 international scientists 
who are beginning their 
independent careers. The 
medical-research giant 
announced on 24 January that 
the investigators come from 
12 countries, with China, 
Portugal and Spain leading 
the way. Each person, selected 
from the 760 applicants, will 
receive $650,000 over 5 years. 


} RESEARCH 
A chip off Mars 


A meteorite that fell to Earth 
last July in Morocco has proven 
to be a rare chunk of Mars. 
Only a handful of Martian 
meteorites are known, and 
only five (including the new 
find) were seen falling to Earth 
— an important factor because 
it tells scientists how much 
time the meteorites have had 
to pick up contamination on 
the ground. The last witnessed 


Initial public offerings (IPOs) in clean energy stuttered in 2011 


after a recovery in 2010. 
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To inform its priorities 
in tackling climate 
change, the UK 
government publishes 

a study that weighs up 
climate risks according 
to their financial, social 
and environmental cost. 


fall was in 1962. About a 
dozen pieces of meteorite were 
recovered from Morocco in 
December, and were certified 
on 17 January as coming 

from Mars. See go.nature. 
com/9scfwq for more. 


Bat scourge 

Between 5.7 million and 

6.7 million bats have been 
killed by white-nose syndrome 
in North America since 2006, 
biologists agreed on 17 January, 
after a meeting of the Northeast 
Bat Working Group in Carlisle, 
Pennsylvania. A 2009 estimate 
had put the death toll at 

1 million. The deadly disease, 
caused by a fungus, was first 
documented in February 2006 
ina cave in New York, and 

has spread to at least 16 other 
US states and 4 Canadian 
provinces. Mortality rates in 
surveyed bat colonies range 
from 70% to more than 90%. 


Altering embryos 


Researchers at Newcastle 
University, UK, have received 
£5.8 million (US$9 million) 
from the university and 

the Wellcome Trust in 
London to assess the safety 
of techniques that transfer 
genetic material between 
human eggs to produce 
embryos free of inherited 
mitochondrial diseases. The 
award was announced on 

19 January. On the same day, 
British regulators launched 
a public consultation on the 
procedures, a step towards 
legalizing their clinical use. 
See page 419 for more. 
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Countries with endemic H5N1 avian flu, such as India, often have limited resources to control the virus. 


BIOSAFETY 


Caution urged for 
mutant flu work 


Public-health benefits of controversial research questioned. 


BY DECLAN BUTLER 


hy would scientists deliberately 
create a form of the H5N1 avian 
influenza virus that is probably 


highly transmissible in humans? In the 
growing debate about research that has done 
precisely that’, a key question is whether the 
public-health benefits of the work outweigh 
the risks of a potential pandemic if the virus 
escaped from the lab. 

For the scientists who have created the 


mutated strains of the H5N1 virus, the 
justifications are clear. Surveillance of flu 
viruses could, they argue, allow health organi- 
zations to monitor birds and other animals 
for the mutations that would provide an early 
warning of a pandemic and enable authorities 
to act quickly to contain the virus. 

That claim is meeting with scepticism, how- 
ever. More than a dozen flu experts contacted 
by Nature say they believe that the work opens 
up important vistas in basic research, and that 
it sends a valuable warning about the potential 


for the virus to spark a human pandemic. But 
they caution that virus surveillance systems are 
ill-equipped to detect such mutations arising 
in flu viruses. As such, work on the viruses is 
unlikely to offer significant, immediate public- 
health benefits, they say. 

That tips the balance of risk—benefit assess- 
ment in favour of a cautious approach, says 
Michael Osterholm, who heads the University 
of Minnesota’s Center for Infectious Disease 
Research and Policy in Minneapolis, and who is 
amember of the US National Science Advisory 
Board for Biosecurity (NSABB). 

In a paper submitted to Science, Ron 
Fouchier’s team at Erasmus Medical Center in 
Rotterdam, the Netherlands, found that just 
five mutations allowed avian H5N1 to spread 
easily among ferrets, which are a good proxy for 
how flu behaves in other mammals, including 
humans. All five mutations have been spotted 
individually — although not together — in 
wild viruses. Yoshihiro Kawaoka of the Univer- 
sity of Wisconsin-Madison and his colleagues 
have submitted similar work to Nature, which 
is partially described in an online Comment 
published this week’, 

Acting on advice from the NSABB, the 
US government last month asked Science and 
Nature to publish only the broad conclusions of 
the two studies, and not to reveal the scientific 
details, in order to limit the risk that uncon- 
trolled proliferation of such research might lead 
to accidental or intentional release of similar 
mutant viruses. The journals and the authors 
have agreed to this redaction, provided that a 
mechanism is established to disseminate the 
data to flu researchers and public-health offi- 
cials on a need-to-know basis. The US govern- 
ment, the World Health Organization (WHO) 
and other bodies are now trying to put this 
mechanism together, along with a framework 
for international oversight of such research. 

Last week, in a statement jointly published in 
Nature and Science’, 39 flu researchers declared 
a 60-day pause in the creation of lab mutant 
strains of the H5N1 avian flu virus. The hiatus, 
they hope, should give scientists and policy- 
makers time to debate how such research might 
best proceed, and what safety measures should 
be required of labs that 
handle the virus. The sig- 
natories to the statement, 
including the key authors 
behind the controversial 
research, plan to bring 
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> together some 50 experts ata WHO-hosted 
meeting in Geneva, Switzerland, next month to 
discuss these thorny issues. 

Scientists contacted by Nature say that basic 
research on such mutated strains may eventually 
yield insight relevant to developing pandemic 
countermeasures such as drugs and vaccines. 
And they all agree that the new research has 
done the world a service by showing that HSN1 
seems capable of evolving the ability to spread 
rapidly among humans, in contrast to what 
some scientists have claimed. 

Fouchier’s study “raises a red flag’, says Iaria 
Capua, an animal-flu expert at the Experimen- 
tal Animal Health Care Institute of Venice in 
Legnaro, Italy. “That is the real, and most impor- 
tant message, of this whole exercise.” It should 
prompt donors and international organizations 
to ramp up their funding of efforts to control 
outbreaks of the H5N1 virus in poultry, and so 
give the virus fewer opportunities to evolve into 
a human pathogen, she says. Other scientists 
add that it should force governments to rethink 
existing vaccine technologies, which are only 
capable of supplying vaccine six months after a 
pandemic starts, and of producing enough vac- 
cine for a small fraction of the world population. 


PATCHY SURVEILLANCE 
But the notion that the research offers a guide 
to dangerous variants that could be stamped 
out before they spread is unrealistic, say 
Osterholm and other researchers. “In order to 
even consider the possibility of reducing the 
animal reservoir of an emerging pandemic 
virus, one would need rapid and complete 
detection of virus in all geographical areas,” 
Osterholm says. Yet surveillance of H5N1 in 
poultry worldwide is patchy, particularly in 
poorer countries, where the virus is prevalent. 
It is also largely geared towards simply detect- 
ing and monitoring outbreaks, and few of the 
viral samples collected are ever sequenced. 

Last year, global surveillance resulted in 
partial sequences from just 160 H5N1 isolates 
being submitted to GenBank, the main reposi- 
tory of such data (see ‘Sequence shortfall’). And 
virus isolates are often sequenced months or 
years after they are collected — hardly the swift 
turnaround of a pandemic alert system. “Could 
we pick up a mutation in real time and stop a 
pandemic?” asks Capua. “Not with the surveil- 
lance we have now.” 

Moreover, if H5N1 surveillance in poultry 
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is poor, the situation is far worse in pigs, where 
there is almost no systematic surveillance, even 
in richer countries. HSN1 infections in pigs are 
uncommon and cause only mild illness, creat- 
ing little economic incentive to monitor them’. 
GenBank contains partial sequences from just 
24 pig H5N1 isolates. Yet pigs are a likely source 
of a human pandemic H5N1 virus because 
they are susceptible to both human and avian 
viruses, creating opportunities for genetic reas- 
sortment in co-infected animals. 


SEQUENCE SHORTFALL 


The number of H5N1 virus isolates sequenced and 
deposited in the GenBank database has waned in 
recent years. It holds very few sequences from pigs, 
a likely source of any human pandemic. 
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Fouchier argues that many countries 
collect more, and more-timely, sequence data 
than those deposited in GenBank. “That some 
outbreak countries are not yet fully capable of 
making optimal use of such data should not 
lead to the fatalistic conclusion not to gener- 
ate and share the data as they emerge,” he says. 
“Warnings weeks after dangerous viruses have 
emerged in poultry, or mammals, may be better 
than no warnings at all.” 

But even if a candidate pandemic H5N1 
virus was detected in poultry, culling flocks to 
eliminate it would be no mean feat. H5N1 has 
become endemic in many countries, including 
China, Vietnam, Indonesia, Bangladesh, India 
and Egypt, and the United Nations’ Food and 
Agriculture Organization estimates that, with 
current resources, it would take at least a decade 
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to stamp out the virus in such countries. 

The relative ease of making H5N1 trans- 
missible between mammals in the lab should 
now prompt the world to address these glar- 
ing inadequacies in surveillance, says Jeremy 
Farrar, director of the Oxford University 
Clinical Research Unit in Ho Chi Minh City, 
Vietnam. Molecular technologies need to be 
made more easily available and affordable to 
countries at risk, and genetic surveillance more 
comprehensive and timely. But building such 
systems would require sustained political will, 
financial resources, and overcoming major 
logistical hurdles in the field. 


OTHER MUTATIONS 

Even if such a revolution occurred, looking 
only for the mutations reported by Fouchier 
and Kawaoka would be short-sighted, says 
Marc Lipsitch, an epidemiologist at the 
Harvard School of Public Health in Boston, 
Massachusetts. “There are many, many, other 
ways that the virus could become transmissi- 
ble,’ he says. “It would be very unfortunate if 
people say that if we dont see these mutations, 
we dont need to worry,’ he adds. Moreover, says 
Farrar, H5N1 is far from being the only flu virus 
that poses a pandemic threat. But he believes 
that more extensive genetic surveillance could 
eventually pay off. “The research points us to 
where we need to go, rather than where we are 
today,” he says. “Before this research, we were 
all guessing what changes might be needed. 
This work pushes that forward” 

Asked whether there were any areas in which 
the mutant flu research could provide imme- 
diate public-health benefits, Anthony Fauci, 
director of the US National Institute of Allergy 
and Infectious Diseases, replies: “I would say 
that, in a perfect world, the immediate benefit 
would be in surveillance. But from a logistics 
standpoint, in the world we live in, that will be 
difficult to do” 

Because the value of these studies is more 
likely to emerge in the longer term, it makes 
sense to take time to consider how such 
research can proceed safely, he says. “What's 
the rush?” he asks. “I am very much in favour 
of having a pause in the research.’ = 


1. Nature 480, 421-422 (2011). 

2. Kawaoka, Y. Nature http://dx.doi.org/10.1038/ 
nature10884 (2012). 

3. Fouchier, R.A. M. et al. Nature 481, 443 (2012). 

4. Nature 459, 894-895 (2009). 
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Researchers are studying two ways to fuse a diseased egg 
with a healthy one to prevent rare mitochondrial diseases. 


TWO WAYS TO CRACK AN EG 


Maternal spindle transfer 


Spindle complex 


These are added to an 
unfertilized donor egg 


The chromosomes are 
removed from an unfertilized 
egg that has mutated 


mitochondrial DNA. removed. 


Pronuclear transfer 


An egg with mutated 
mitochondrial DNA is 
fertilized in vitro. 


The resulting 
pronucleus is 
removed. 


EMBRYOLOGY 


Normal mitochondrion 


that has had its nucleus 


149) >( 


Pronucleus 


The egg develops 
normally to form an 
embryo. 


This fused egg is 
fertilized in vitro. 


The fused egg 
develops normally. 


This is transferred to a 
fertilized donor egg that has 
had its pronucleus removed. 


UK sets sights on 
gene therapy in eggs 


Public consultation and safety assessment would pave the 
way for embryo manipulation to treat genetic diseases. 


BY EWEN CALLAWAY 


ritain has set out a road map towards the 
B first clinical tests of reproductive tech- 
niques that combine parents’ genes with 
DNA froma third party. The approach raises 
ethical questions, but could spare children from 
inheriting some rare diseases, including forms 
of muscular dystrophy and neurodegenerative 
disorders that affect around 1 in 5,000 people. 
These conditions are caused by defects in 
the mitochondria, the ‘power packs’ of the 
cell, which are inherited from a child’s mother 
through the egg. Experiments on primates, 
and with defective human eggs, have already 
shown that genetic material can be removed 
from an egg that has faulty mitochondria and 
transferred to a healthy donor ovum, leaving 
the flawed mitochondrial DNA behind. In 
principle, the resulting egg could then develop 
into a healthy child carrying both the parents’ 
nuclear genes and mitochondrial DNA from 
the donor. But the work amounts to genetic 
modification of embryos — which is currently 
illegal in the United Kingdom — and also 
involves destroying fertilized eggs. 
On 19 January, the UK government's Human 
Fertilisation and Embryology Authority 


(HFEA) announced a public consultation on 
the process, the first step towards making it 
legal. Simultaneously, the country’s biggest 
biomedical charity, the Wellcome Trust, said 
that it would fund preclinical experiments to 
gauge the safety of the techniques. An inde- 
pendent bioethical review is also in progress. 
“It’s a wonderful example of how regulation 
should work, because it’s saying let’s see the sci- 
ence, let’s see the bioethics, let’s find out what 
the public thinks,” says Peter Braude, a repro- 
ductive biologist at King’s College London. 

Two procedures are being developed: pro- 
nuclear transfer and maternal spindle transfer 
(see “Two ways to crack an egg’). US research- 
ers have already used maternal spindle trans- 
fer to produce two healthy rhesus monkeys 
(M. Tachibana et al. Nature 461, 367-372; 
2009). Meanwhile, neurologist Douglass Turn- 
bull of Newcastle University, UK, and his team 
have performed pronuclear transfer on defec- 
tive human eggs, and found that normal devel- 
opment occurred in a small minority (L. Craven 
et al. Nature 465, 82-85; 2010). 

In 2011, Braude co-authored a report for the 
HFEA that concluded that the nuclear transfer 
procedures seem safe on the basis of current 
research. The report outlined several studies 
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that it deemed “critical” before either procedure 
could be used to create a child. These include 
experiments on healthy human eggs, and proof 
that the pronuclear transfer technique can con- 
ceive healthy monkeys. 

The Wellcome Trust has now awarded 
Turnbull £4.4 million (US$6.8 million), which 
his university has topped up with another 
£1.4 million, to perform the procedures on 
healthy human eggs. Turnbull and his team 
hope to determine whether the embryos can 
safely reach the 100-cell blastocyst stage, when 
implantation occurs. Yet they have no plans to 
test pronuclear transfer in monkeys, and it is 
unclear who would conduct such experiments. 

Shoukhrat Mitalipov, a reproductive biolo- 
gist at Oregon Health and Science University 
near Portland, who led the work on rhesus 
monkeys, says that his lab uses the maternal 
spindle transfer technique because it is easier to 
perform. Mitalipov has begun experiments on 
human eggs in a privately funded lab. Such work 
is barred from receiving federal funds because 
it involves the destruction of human embryos. 


NO SCRAMBLE 

Turnbull is agnostic about which, if any, 
procedure is likely to reach the clinic first. One 
technique may prove more efficient than the 
other, or may shuttle fewer mutant mitochon- 
dria to the donor egg. “We've got to be careful 
that this is nota race,’ Turnbull says. Taking the 
technique beyond the lab would also require a 
change in the law. Nuclear transfer procedures 
are banned under the UK Human Fertilisation 
and Embryology Act, but the government 
could amend the law for procedures used to 
treat mitochondrial diseases. 

Meanwhile, the Nuffield Council on Bioethics 
in London has begun an independent review 
of the procedures. Science writer Geoff Watts, 
chairman of the working group behind the 
review, believes that it will inform the govern- 
ment’s decision. It will consider issues such as 
whether the procedures should be used to con- 
ceive only males, who will not pass on any donor 
mitochondria to their children. The council says 
that it will deliver a report in the summer. 

Braude worries that, outside the United King- 
dom, “somebody will be a cowboy” and attempt 
the procedures without regulatory oversight. Yet 
government hurdles could prevent the tech- 
niques from being adopted quickly elsewhere. 
In Australia, for example, a government review 
in 2011 recommended continuing a ban on such 
techniques. In the United States, the Food and 
Drug Administration would be responsible for 
approving them. Such restrictions, combined 
with the US ban on federal funding of human 
embryo work, mean that the first clinical trials 
of nucleus-swapping procedures will almost 
certainly occur in the United Kingdom. 

“We've moved to the stage where we think 
that the preliminary evidence is very support- 
ive,’ says Turnbull, “and we think it's a very good 
idea that they consider this.” m SEE EDITORIALP.410 
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SCIENCE INVESTMENT 


Research in Asia heats up 


US indicators reveal challenges and opportunities as science momentum shifts to China. 


BY EUGENIE SAMUEL REICH 


t is a mantra that plays readily to US 

competitive fears: Asia, led by China, is on 

track to displace the United States as the 
world’s science and technology powerhouse. 
That message is loud and clear in the 2012 edi- 
tion of Science and Engineering Indicators, a 
nearly 600-page snapshot of the state of global 
research that looks at education, academic 
infrastructure, the knowledge-based work- 
force and international markets. Yet some 
policy experts say that the trends reveal oppor- 
tunities for partnerships that could benefit the 
United States. 

“Our country needs to worry about science 
and innovation when so much is being done 
out of the country. Long-term, this might be 
harmful to our competitiveness,” says Ray 
Bowen, chairman of the US National Science 
Board, which produced the report and over- 
sees one of the US government's main research 
funding agencies, the National Science Foun- 
dation (NSF). 

The report, released on 17 January, finds that 
by 2009, the combined research and develop- 
ment (R&D) investment from a group of ten 
Asian economies including China and India, 
had caught up with that of the United States 
(see ‘Rising influence’). “One hopes that the 
new data will help to reinforce the message that 
the US government (as well as industry) needs 
to keep R&D investments at the top of its pri- 
orities, despite current fiscal constraints,” says 
Claude Canizares, vice-president for research 
at the Massachusetts Institute of Technology 
in Cambridge. 

Yet Ron Hira, an engineer and science- 
policy expert at the Rochester Institute of 
Technology in New York, says that the real 
problem is that the United States invests too 
little in R&D, but that its ability to channel 
that investment into high-tech manufactur- 
ing has been declining since 2000. Hira says 
that India and China have been more pro- 
active in protecting and fostering high-tech 
industries. In the United States, “we have a 
surplus of researchers doing endless postdocs’, 
he says. “The worry is how that translates into 
economic growth” 

Henry Sauermann, who studies science and 
innovation at the Georgia Institute of Technol- 
ogy in Atlanta, says that the United States may 
need to alter its perspective. The indicators 
suggest that intellectual property and trained 
personnel are increasingly likely to come from 
Asia and developing countries. The United 


RISING INFLUENCE 


Research and development investment in ten Asian economies has tripled in the past 15 years ([M), as has 
the number of journal articles ([). Collaborations between US and Asian researchers have been roughly 
constant since 2000 ([E]), suggesting that the United States could do more to take advantage of Asian 
research. The trend looks set to continue, with the numbers of science degrees in China rising rapidly (4). 


Bl Spending on research 
and development 


— United States — Asia-10* 
— European Union == Rest of world 


450° 


Expenditure (US$ billions) 


1996 1998 2000 2002 2004 2006 2008 


EE] US collaborations 


= Taiwan =—= China — European 
— South Korea —= India Union 


— Japan — = Singapore 


Index (1.0 is expected number of collaborations) 


2000 2002 2004 2006 2008 


2010 


FA Number of science and engineering 
journal articles produced 


— United States == Asia-10* 


— European Union == Rest of world 


200 - 


150 


ournal articles (thousands) 


100 


J 


50.2 


0Tnnnwnn»®n>ooo 
1995 1997 1999 2001 2003 2005 2007 2009 
[EY Degrees in natural sciences 


= United States == United States 
first degrees doctoral degrees 


== China doctoral 
degrees 


== China first 
degrees 


Number of degrees (thousands) 


oa 
lo} 


8 oa es co ee 
2000 2004 2008 


p= 


2000 2004 2008 


*Asia-10: China, India, Indonesia, Japan, Malaysia, Philippines, Singapore, South Korea, Taiwan, Thailand 


States should take advantage of Asian ideas 
and expertise rather than thinking of itself as 
a producer of value that may be lost, says Sau- 
ermann. For example, US researchers could 
make greater efforts to read Chinese publica- 
tions and collaborate with Chinese institu- 
tions. “Increasingly we see China paying for 
research and putting it out there for everyone 
to use. That’s an opportunity.” 

Caroline Wagner, a science-policy expert 
at Ohio State University in Columbus, says 
that it is important to be aware that much of 
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the science done in developing countries is 
not captured by the report, which covers only 
research published in journals that are indexed 
by Thomson-Reuters — fewer than 10% of the 
world total, she estimates. The remainder is 
likely to be in languages other than English, 
and benefits its country of origin — through 
research on Brazilian fruit trees or fungus in 
China, for example — but not necessarily other 
members of the global economy, such as the 
United States. “There's an awful lot of unseen 
science,’ she says. m 


SOURCE: NSF 
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Mitt Romney (left) and Newt Gingrich have shifted their views on the threat posed by climate change. 
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Candidates play to 
the right on science 


Rivals for the Republican nomination laud research but 
take a hard line on embryonic stem cells and climate. 


BY SUSAN YOUNG 


S presidential elections do not tend 
| to revolve around science. But thanks 
to the latest twist in the race for the 
Republican candidacy, the unusual topic of 
mining the Moon looks set to be up for dis- 
cussion. Lunar resources and the space pro- 
gramme in general are popular themes with 
former US House speaker Newt Gingrich. Fol- 
lowing his decisive victory over the Republican 
frontrunner, former Massachusetts governor 
Mitt Romney, in the South Carolina primary 
on 21 January, Gingrich — whose fortunes 
have zigzagged during the campaign — is back 
in the media spotlight. 

Romney and Gingrich now dominate an 
unsettled contest. Both are taking staunchly 
conservative positions on controversial sci- 
ence issues: they are against regulating carbon 
emissions and oppose embryonic stem-cell 
research. Yet both have held more moderate 
views in the past. And, with concern grow- 
ing about unemployment and international 
competition (see page 420), the state of US 
research and its role as an economic driver 
could have more resonance than usual in this 
year’s presidential race. 


“The public supports science strongly,’ says 
Mary Woolley, president of the medical-science 
advocacy group Research! America, based in 
Alexandria, Virginia. “They can connect the 
dots between science and better jobs for them- 
selves, their children, their grandchildren. They 
don’t want to be part of a country that’s not mak- 
ing progress in science at the level we used to” 

Last November, the group sent a list of 
research-oriented questions to all candidates, 
including US President Barack Obama, against 
whom the winner of the Republican contest 
will face offlater this year. So far, only Gingrich 
and Obama have responded. In keeping with 
Republican ideology, Gingrich credits the US 
private sector as the key driver of technical inno- 
vation and success. But his tone is urgent. “We 
are on the cusp of an explosion of new science 
that will create new opportunities in health, 
agriculture, energy, and materials technology,’ 
he writes. “We must move quickly and decisively 
so we will be the creators of this innovation, 
not just the recipients of it.” Now, with the race 
moving on to the Florida primary on 31 January, 
Gingrich has promised to deliver a visionary 
speech on the US space programme, in the tra- 
dition of Democratic president John F. Kennedy. 

Yet, on stem cells, Gingrich leans firmly > 
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> to the right, mindful that many social con- 
servatives in the United States equate research 
on human embryos with abortion. In 2009, 
Obama lifted a ban imposed by former Repub- 
lican president George W. Bush that limited 
federal funding for human embryonic stem- 
cell research to just a handful of existing cell 
lines. By contrast, Gingrich says he would 
“oppose at every turn any process of destroy- 
ing embryos”. That could mean a freeze on the 
approval of new lines, or possibly an end to 
federal funding for such work altogether. 

Romney’s history on the issue is more com- 
plex. As a candidate for governor in 2002, he 
voiced general support for stem-cell science. 
But, once in office, he vetoed a 2005 bill that 
allowed cloning to create human embryonic 
stem cells for research. Romney was overruled 
by the state's legislature, but the following year 
his administration set down regulations that 
could have criminalized the work of scientists 
using human embryonic stem cells. However, 
no scientists were prosecuted under the regula- 
tions, and the bill was amended when the next 
governor took over. 

The positions that Romney and Gingrich 
now espouse suggest that if either were to be 


in the White House with a Congress similar to 
today’s, US stem-cell policy would take a hard 
right turn. “It would be very likely that there 
would be a reduction or elimination of fund- 
ing for embryonic stem-cell research’, says Alta 
Charo, a professor of law and bioethics at the 
University of Wisconsin-Madison. 
Romney’s stance 


“Any time a on climate has also 
scientist stands _ shifted. In 2004, he 
up in a public issued a ‘climate pro- 
forum and says tection plan’, with 
climatechangeis targets for reduc- 
realandweneed _ ing carbon dioxide 
todo something emissions from the 
about it, there state’s power plants. 
are immediate However, by Decem- 


ber 2005 he backed 
away, on economic 
grounds, from a regional initiative in which 
several northeastern states banded together 
to reduce emissions. 

Gingrich’s turn on climate has been even 
more abrupt. In 2008, he teamed up with Dem- 
ocrat and then House speaker Nancy Pelosiina 
television advert promoting political action on 
climate change. He has since been chastised for 


repercussions.” 
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this by conservative commentators. Gingrich 
now says he regrets the ad and, in recent weeks, 
has further distanced himself from his proactive 
former stance. After he was criticized on right- 
wing talk radio for involving Katharine Hayhoe, 
a climate scientist at Texas Tech University in 
Lubbock, in a book project on environment 
entrepreneurship, Gingrich said he would 
drop Hayhoe’s chapter from the book. Hayhoe 
supports the idea that human activity is driving 
climate change. “Any time a scientist stands up 
ina public forum and says climate change is real 
and we need to do something about it, there are 
immediate repercussions,’ Hayhoe says. 

Yet whoever becomes the Republican can- 
didate — and whoever ultimately becomes 
president — these disputes may not mean 
much for the support of science as a whole. 
Since the cold war, both Republican and 
Democratic administrations have a steady 
record of support for basic science. And with 
the US electorate focused on the economy and 
unemployment, it seems likely that if science is 
discussed at all in this year’s campaign it will be 
in the context of jobs and competitiveness — 
issues on which all candidates, no matter what 
their ideology, see a need for action. m 


French institute prepares 
for gene-therapy push 


Genethon relaunches itself as a force for translational medicine. 


BY ALISON ABBOTT 


r Vhe French biotechnology institute 
Genethon is perhaps best known for 
its unusual funding source — annual 

television appeals — and for its mapping 

of the human genome in the early 1990s 

(I. Chumakov et al. Nature 359, 380-387; 

1992). Now, after spending years in the sci- 

entific doldrums, it plans to become known 

as the European centre that can speed up the 
process of getting gene therapy for rare genetic 
disorders into routine clinical practice. 
Scientific director Fulvio Mavilio, a molecu- 
lar biologist who took office earlier this month, 
has a mandate to sharpen the research profile of 
the institute, which employs 180 scientific staff 
at its base just outside Paris. One of his main 
strategies is to create international clinical net- 
works for gene therapy around Genethon. The 
institute should become particularly attractive 
to international partners later this year, when it 
opens what will be the world’s biggest plant for 


producing large volumes of clinical-grade viral 
vectors — used to transfer therapeutic genes 
into the cells of patients. 


Fulvio Mavilio is Genethon’s new scientific director. 


“Getting vectors is a bottleneck for us,” says 
Adrian Thrasher, director of the gene-therapy 
programme at University College London’s 
Institute of Child Health, and a Genethon col- 
laborator. “Genethon’s new strategy is realistic.” 

The first gene-therapy trials involved a 
handful of children in Italy and France who 
had rare and fatal immunodeficiency disor- 
ders, and showed that healthy genes could be 
transferred stably into patients to reverse their 
symptoms. 

The early successes spawned a period of 
hype that came to an abrupt end with the 
1999 death of Jesse Gelsinger, a teenager 
in the United States who had a profound 
immune reaction to his gene therapy, as well 
as the emergence of cancers in some immune- 
deficient children who had been treated. 
Progress has been slow and cautious ever since. 
More than a thousand proof-of-principle clini- 
cal studies have been done around the world 
(see ‘Gene promise’), and dozens have shown 
positive results, but as yet no form of gene > 


26 JANUARY 2012 | VOL 481 | NATURE | 423 


© 2012 Macmillan Publishers Limited. All rights reserved 


| NEWS | IN FOCUS 


> therapy has been approved for routine use 
by the US Food and Drug Administration or 
the European Medicines Agency. That situa- 
tion must change, according to the board of 
directors of the French Muscular Dystrophy 
Association, which created Genethon in 1990 
and has funded it ever since through its annual 
telethons. 


BROAD POTENTIAL 

Although the best-known gene-therapy trials 
have been done in children with immune defi- 
ciencies, the technique could tackle a much 
wider array of diseases. Most of the current 
clinical studies are in cancer, with researchers 
trying to introduce genes that will kill the can- 
cer cells directly, or prod the patient’s immune 
system into attacking them. And on 6 January, 
the American Society of Gene and Cell Ther- 
apy sent the director of the US National Insti- 
tutes of Health a list of the diseases it believes 
will benefit most in the next six years from 
investment in translating basic research to 
the clinic. It included rare immunodeficiency 
and eye disorders, as well as more common 
blood disorders, two cancers and Parkinson's 
disease. 

Mavilio worked on the world’s first gene- 
therapy trial, which treated children with the 
immunodeficiency disorder ADA-SCID at 
the San Raffaele Scientific Institute in Milan. 


GENE PROMISE 


Dozens of gene-therapy clinical trials are now 
approved by regulatory authorities every year. 


Another 141 trials have been approved at unspecified times. 


Pharmaceutical giant GlaxoSmithKline last 
year forged a multi-million-euro alliance 
with the institute to develop similar gene 
therapy for rare diseases, becoming the first 
pharmaceutical giant to invest significantly 
in the field. 

“The disadvantage of Genethon compared 
to the San Raffaele is that it does not have its 
own hospital,” Mavilio says. “We will be very 


proactive in forging collaborations with top 
clinicians in Europe and beyond so that we 
can become a major hub for gene-therapy 
networks.” 

The vector-production facility will be 
a huge asset for this, he says. “But it won't 
be enough. To get good collaborations, we 
also have to be known as a force in science.” 
Genethon moved away from basic research 
in 2006, to focus on vector production. It 
now gets more than 90% of its money from 
the Telethon, but Mavilio wants this to be 
significantly supplemented by competitive 
research grants. 

Philippe Moullier, director of the French 
national biomedical research agency’s gene- 
therapy unit in Nantes, and a Genethon 
adviser, warns that the firm needs “to remain 
humble and move slowly — I don’t know if 
we can demand to become a European hub”. 
Thrasher, at least, is enthusiastic about the insti- 
tute’s ambitions, predicting that “Genethon will 
probably become our first port of call”. m 


CORRECTION 

The photo caption in the News story 
‘Gemini’s twin telescopes reboot’ (Nature 
481, 251; 2012) incorrectly identified the 
Gemini North telescope as Gemini South. 


SOURCE: J. GENE MED. 
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HUNT FOR THE 


ANCIENT MARINER 


Armed with high-tech methods, researchers are scouring the 
Aegean Sea for the world’s oldest shipwrecks. 


BY JO MARCHANT 


rendan Foley peels his wetsuit to the waist and perches on the 

side of an inflatable boat as it skims across the sea just north 

of the island of Crete. At his feet are the dripping remains of 

a vase that moments earlier had been resting on the sea floor, 

its home for more than a millennium. “It’s our best day so 
far,’ he says of his dive that morning. “We've discovered two ancient 
shipwrecks.” 

Foley, a marine archaeologist at the Woods Hole Oceanographic 
Institution in Massachusetts, and his colleagues at Greece’s Ephorate 
of Underwater Antiquities in Athens have spent the day diving near 
the cliffs of the tiny island of Dia in the eastern Mediterranean. They 
have identified two clusters of pottery dating from the first century BC 
and fifth century ap. Together with other remains that the team has 
discovered on the island’s submerged slopes, the pots reveal that for 
centuries Greek, Roman and Byzantine traders used Dia as a refuge 
during storms, when they couldnt safely reach Crete. 

It is a nice archaeological discovery, but Foley was hoping for some- 
thing much older. His four-week survey of the waters around Crete last 
October is part ofa long-term effort to catalogue large numbers of ancient 
shipwrecks in the Aegean Sea. And the grand prize would bea wreck from 
one of the most influential and enigmatic cultures of the ancient world 
— the Minoans, who ruled these seas more than 3,000 years ago. 

Some researchers believe that quest to be close to impossible. But 
Foley and a few competitors are using high-tech approaches such as 
autonomous robots and new search strategies that they say have a 
good chance of locating the most ancient of shipwrecks. If they suc- 
ceed, they could transform archaeologists’ understanding of a crucial 
period in human history, when ancient mariners first ventured long 
distances across the sea. 
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Archaeologists have precious little information about the seagoing 
habits of the Minoan civilization, which erected the palace of Knossos 
on Crete — linked to the Greek myth of the Minotaur. Minoans far 
exceeded their neighbours in weaponry, literacy and art, and formed 
“part of the roots of what went on to become European civilization’, 
says Don Evely, an archaeologist at the British School at Athens, and 
curator of Knossos. Archaeologists are keen to understand what made 
the Minoans so successful and how they interacted with nearby cultures 
such as the Egyptians. 

Although researchers have studied scores of Roman ships, finding 
a much older Minoan wreck “would add 100% new knowledge’, says 
Shelley Wachsmann, an expert in ancient seafaring at Texas A&M Uni- 
versity in College Station. 


UNDERWATER TREASURE 

A Bronze Age wreck called Ulu Burun shows how the remains of a 
single ship can transform archaeologists’ understanding of an era. Dis- 
covered in 1982, it lies about 9 kilometres southeast of Kas in southern 
Turkey, and dates from around 1300 Bc, a century or two after the 
Minoans disappeared. 

Christos Agourides, secretary-general of the Hellenic Institute of 
Marine Archaeology in Athens, describes it as “the dream of every 
marine archaeologist” It took ten years to excavate, and researchers are 
still studying the nearly 17 tonnes of treasures recovered. The vast cargo 
includes ebony, ivory, ostrich eggs, resin, spices, weapons, jewellery and 
textiles as well as ingots of copper, tin and glass. 

But what really stunned archaeologists was that the artefacts on 
this one vessel came from at least 11 different cultures’ — from a gold 
scarab bearing the name of the Egyptian queen Nefertiti to copper from 
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Cyprus and tin from central Asia. 

The wreck provided tangible evidence of an astonishing array of con- 
tacts and trade between the different cultures of the Mediterranean and 
Near East in the late Bronze Age. The Ulu Burun ship sailed at around 
the time that Tutankhamun ruled Egypt, and “it is far more important 
than Tutankhamun’s tomb as a contribution to our understanding of 
the period’, according to Wachsmann. “This goes to the nitty gritty of 
the world. It’s Wall Street in a ship” 

The earlier Minoans set the stage for such a widespread trading net- 
work through their domination of the eastern Mediterranean. Their 
seafaring abilities were still celebrated 1,000 years later by Greek histo- 
rian Thucydides, who credited the Minoans with building the world’s 
first navy and ridding the seas of pirates. Although other contemporary 
Mediterranean cultures were starting to travel across the sea, the Mino- 
ans ventured farther than others, reaching distant ports in Syria, Cyprus, 
the Cyclades and Egypt (see map). Wachsmann describes them as the 
“Christopher Columbuses of the Bronze Age”. 

Researchers have already found one potential Minoan wreck site by 
the island of Pseira, off the northeast coast of Crete. In 2003, archaeologist 
Elpida Hatzidaki of the Ephorate of Underwater Antiquities discovered 
a large collection of underwater pottery dating to around 1800 Bc. 

But at this site and a few even older ones, no portion of the ship itself 
survives, and it is hard to determine whether the pottery came from a 
wreck, was simply thrown overboard, or washed into the sea from the 
nearby coast. Even those who believe the Pseira site does represent a 
Minoan wreck admit that the pottery itself — everyday ware of local 
origin — doesn’t reveal much new information. What archaeologists 
crave is an equivalent of Ulu Burun, a long-distance trading ship packed 
with valuable cargo that would reveal how different cultures interacted. 
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“Ships were the way that people communicated and moved about the 
ancient world, says Foley. “So if we can find these ancient wrecks, we get 
a much clearer view of the very dim past.” 

That dream lured Foley and his team to Crete last year, and they 
brought a new tool that they hope will significantly raise the chances of 
finding an ancient shipwreck. In the past, archaeologists have explored 
the sea floor using divers and, more recently, remotely operated vehicles 
(ROVs) that are controlled by pilots on ship. Foley's team tested an auton- 
omous diving robot that could search the ocean bottom for hours under 
its own command. The REMUS 100 vehicle (for Remote Environmental 
Monitoring Underwater System) is equipped with Global Positioning 
System technology, side-scan sonar and a video camera. The Woods 
Hole researchers worked on the project with Greek archaeologists led 
by Theotokis Theodoulou of the Ephorate of Underwater Antiquities. 


ROBOT ROVERS 

The torpedo-shaped robot, nicknamed Gudgeon after a Second World 
War submarine, spent the first month of the field campaign surveying 
the entire sea floor north of Crete’s main harbour, Heraklion, for any 
lumps and bumps that might signal an ancient wreck. 

Foley had high hopes for the area because it had been a port for mil- 
lennia and had never been surveyed by archaeologists. But the search 
came up empty handed. Close to shore, there was no hope of finding 
ancient wrecks because the sea floor was covered in a thick pile of sedi- 
ments that had washed off the island. Farther out, the researchers found 
furrows left by trawl fishermen, who had scraped the sea floor clean, 
even in areas where trawling is supposedly forbidden. 

So Foley’s team moved its search to Dia, which lies just north of 
Heraklion. In 1976, the ocean explorer Jacques Cousteau found some 
ancient remains there, and Foley suspected that Dia might be a fertile 
site for shipwrecks because its steep cliffs could be lethal to vessels 
caught ina storm. 

The team took a two-pronged approach to exploring around Dia. 
The Gudgeon crew prowled Dia’s bays, where the ocean bottom is 
smooth and artefacts are more likely to show up in sonar images. Near 
shore, where the bottom is too rocky for Gudgeon, Foley and his team 
of divers made a circuit of the bays at about 40 metres depth. 

Almost immediately, the divers located five ancient wrecks, rang- 
ing from around the second century BC to the ninth century ap. The 
discoveries confirmed Cousteau’s impression that now-deserted Dia 
was used for centuries as an anchorage. And Foley was convinced that 
the Minoans must have been here too, with the evidence perhaps on 
the deeper floor of Dia’s bays. But Gudgeon’s sonar images from those 
sites kept coming back disappointingly clear. 

On the penultimate day of the field season, Greg Packard and Mark 
Dennett of Woods Hole stood on the stern of their small research ves- 
sel, and swung Gudgeon overboard. The miniature explorer descended 
to the bottom and spent the morning cruising back and forth along 
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preprogrammed gridlines. Later that evening, when Packard examined 
the sonar data, he spied a potential target — a patch of bright speckles 
amid the smooth dark image. The team debated whether it could be a 
heap of pottery on the sand. 

The next day, Foley took his crew of divers out to the suspect site. 
Some 15 minutes later, they came back with disheartening news: the 
sonar signal was a collection of plastic water bottles that must have been 
dumped overboard from a modern boat. And footage from Gudgeon’s 
video camera explained the absence of archaeological remains — fur- 
rows in the sand showed that trawlers had cleaned out even these tiny 
bays. Ifa Minoan ship ever sank here, it has long since been destroyed. 
“It’s such a waste,” says Foley, clearly disappointed. “I bet theyre not 
even trawling for fish. I bet they're trawling for antiquities.” 


DIVING DEEPER 

Wachsmann says that he isn’t surprised by what Foley saw. From 2007 
to 2009, he led the Danaos project, using sonar-equipped ROVs to sur- 
vey hundreds of square kilometres of sea floor on a suspected ancient 
trading route between Crete and 
Egypt. In three seasons, he didn't 
find a single ancient wreck from 
any period, and only a scattering 
of artefacts. 

Wachsmann found that sedi- 
mentation was a problem even 
far from shore — up to a metre 
per millennium in some areas. 
This means that although some 
Greek and Roman remains 
might still be visible, a Minoan 
ship would be buried under 3 
or 4 metres of sand. And even at 
500-600 metres depth, he saw 
clear evidence of trawling. “It was 
almost like somebody had swept 
the sea in front of me,” he says. 
On the basis of his experiences, 
Wachsmann now believes that 
the chance of finding a Minoan 
equivalent of Ulu Burun “approaches zero”. 

The effect of bottom trawling is “devastating” for archaeologists, 
agrees Robert Ballard, an oceanographer based at the University of 
Rhode Island in Narragansett, who has pioneered deep-sea exploration 
and discovered the wreck of the Titanic in 1985. “Most of the Aegean 
has been destroyed,” he says. 

Ballard has spent years searching for ancient wrecks and says that 
he has learned the importance of finding areas beyond the reach of 
fishermen — below about 600 metres, say, or close to undersea cables, 
which trawlers avoid. He has also opened up his search area. Historians 
once assumed that the number of wrecks in the deep sea was negligible 
because ancient ships must have hugged the coastlines, but in the 1990s 
Ballard found eight ancient wrecks far from shore between the islands 
of Sicily and Sardinia’ (Foley was Ballard’s graduate student at the time). 
“The ancient mariner was not afraid of going out to sea,’ says Ballard. 

Since 2008, Ballard has been exploring the eastern Mediterranean, the 
Aegean and the Black Sea with a suite of ROVs. Although he is finding 
large numbers of ancient wrecks, he hasnt yet uncovered anything from 
the Bronze Age. But, like Foley, he believes Minoan ships are waiting to 
be discovered. The key to finding the oldest wrecks, he says, is locating 
“relic surfaces” that have escaped being buried by sediment, which flows 

downhill and covers the deep sea floor’. “What you 


NATURE.COM = ~— want is a shipwreck that came down on a moun- 
Foraslideshowof —_ tain; he says, because sediment can't accumulate 
images related to on asteep slope. 

this feature, see: Last year, Ballard investigated the Eratosthenes 
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Divers investigate a first-century-AD wreck near the Greek island of Dia. 


Cyprus, and says it does indeed seem to represent a relic surface. He 
is now applying for permits to return to Eratosthenes to search for 
shipwrecks next year. Another area he would like to investigate is the 
submerged Anaximander mountains south of Turkey. It would be dif- 
ficult to distinguish a wreck site from such rocky terrain using sonar, 
so he plans to use video cameras to conduct a painstaking visual search 
over smaller areas. “It’s very hard hunting,’ he says. 

Foley is also now looking to the deep sea, but has a different strategy. 
Instead of targeting particular sweet spots, he wants to cover as large 
an area as possible. He has raised more than US$1 million towards the 
$1.8 million that he needs to return to the Mediterranean next year, 
this time with two of Gudgeon'’s more powerful cousins, REMUS 6000s 
owned by the Waitt Institute in La Jolla, California. 

To maximize the chances of finding ancient wrecks, the team will 
hunt on open, flat areas in the lowest reaches of the sea, up to 6,000 
metres deep. Foley estimates that the two REMUS vehicles can cover up 
to 5,000 square kilometres in one month, equivalent to 1% of the entire 
Aegean Sea. The recent field trial around Dia encouraged Foley because 
it should be easier for the sonar 
surveys to pick out vases than it 
was to find plastic water bottles, 
which are poor sonar reflectors, 
he says. 

Both Ballard and Foley are 
ultimately hoping to use their 
surveys to catalogue large 
numbers of wrecks of all ages 
across great swathes of the 
Mediterranean and the Black 
Sea. Through a combination 
of sonar and high-resolution 
digital photography, they can 
compile detailed three-dimen- 
sional maps of a wreck site and 
answer questions about the date, 
origin and cargo ofa ship without 
bringing up a single artefact. 

Foley estimates that hundreds 
of thousands of ships must have 
sunk in ancient times — including thousands in the Bronze Age alone 
— and that a significant proportion of those are still sitting at the 
bottom of the deep sea. If he’s right, then perhaps researchers will 
eventually have not just one Minoan ship, but hundreds. With enough 
wrecks, says Foley, “it ought to allow us to draw new conclusions about 
this absolutely formative period in human experience.” 

That could shift marine archaeologists into an era in which they can 
use statistical data gathered from hundreds or thousands of wrecks 
to build up a bigger picture of trade routes, migration and warfare 
throughout history. “Wed rather find 500 ships than excavate one,” 
says Ballard. 

Such a dream seems a long way off as Foley’s team packs up its gear at 
the end of its campaign. Packard and Dennett carefully lower Gudgeon 
into a crate for its long trip back to Woods Hole, while Foley eyes one 
of the artefacts he retrieved from Dia’s waters — a bulbous Byzantine 
amphora covered in deposits left by worms. 

It’s not the find Foley hoped for, but he is undaunted — this is just 
the beginning of what he knows could be a long search. “Td like to be 
doing this every year for the next 20 or 30 years,” he says. “Until ’'m 
too old to go to sea. m 


Jo Marchant is author of Decoding the Heavens: Solving the Mystery 
of the World’s First Computer (William Heinemann, 2008). 
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Lab-management software and 
electronic notebooks are here — and 
this time, it’s more thanjust talk. 


BY JIM GILES 


he basement room in the James H. 

Clark Center contains all the trappings 

of a modern imaging laboratory. An 

X-ray scanner hums away in a corner. 
A miniature magnetic resonance imaging 
(MRI) machine, designed to scan the brains 
of mice, sits nearby. It’s the kind of set-up that 
researchers at well-funded institutions such as 
Stanford University in California, the centre’s 
home, have come to expect. 

One piece of equipment, however, is con- 
spicuous by its absence: the humble paper 
notebook. Michelle James uses her iPad to jot 
down notes, check protocols and monitor the 
progress of her experiments on techniques 
for the early detection of Alzheimer's disease. 
Since she first brought the device into the lab 
around four months ago, it has essentially 
replaced her former hardback notebook. 
“Paper has nothing to offer me,’ declares 
James. 


It’s a refrain heard in more and more labs. 
Some groups have ditched notebooks in favour 
of software from Google, such as free-to-use 
tools for sharing documents, spreadsheets and 
calendars. Others are finding that software 
designed specifically for lab workers has evolved 
to the point where it can reliably do a range of 
tasks, from tracking reagent supplies to sharing 
protocols. The era of the paperless lab, decades 
in the making, seems finally to have arrived. 

Now that it’s here, adopters say that a paper- 
less laboratory seems to offer real advantages. 
Bench researchers say that digital note- 
books help them track their experiments in 
more detail. Lab heads report being able to 
follow and focus projects more efficiently 
when documentation is tailored to their 
needs, accessible online and shared among 
colleagues. In some cases, digital notebooks 
could even help researchers to find conn- 
ections or extract results from systematically 
stored data. “The efficiency thing is nice,” 
says Jonathan Hirsch, founder of Syapse, a 
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company based in Palo Alto, California, that 
develops the software James uses to manage 
and share her results. “But understanding your 
data better is what gets people really excited” 


GOING PAPERLESS, AGAIN 

If these claims sound familiar, that may be 
because the paper notebook’s obituary has 
been written many times before. The paper- 
less laboratory was “nascent” ten years ago, 
according to one American Chemical Society 
journal. Electronic notebooks are “ready for 
prime time’, said Douglas Perry, a bioinfor- 
matics expert, in a Nature News Feature from 
2005 (see Nature 436, 20-21; 2005). 

Yet most early notebook-software programs 
had limited impact, often because they weren't 
easy enough to use. Some worked only with 
specific file types, for example, or had cumber- 
some data-entry mechanisms. Such drawbacks 
have not stopped digital notebooks from taking 
off within the pharmaceutical industry, where 
companies have the funds to customize the 


ANNTHEA LEWIS; SHUTTERSTOCK 


systems and can mandate that employees use 
them. But for academics, these shortcomings 
were deal-breakers. “Everything we tried was 
really crappy,’ says Sriram Kosuri, a bioengineer 
at Harvard Medical School (HMS) in Boston, 
Massachusetts, who experimented with many 
of the older packages. 

Things have changed, thanks in part to the 
arrival of free or cheap sharing tools that are easy 
to use and configure. These tools, from storage 
systems such as Dropbox to the products 
offered by Google and others, rely on fast and 
reliable Internet connections and cloud-based 
storage. They provide a quick and cheap way 
to set up a basic lab infrastructure for sharing 
methods, data and other records. 

At William Shih’s synthetic-biology lab at 
HMS, researchers store records on a password- 
protected network of interlinked pages that 
any group member can edit. His team built 
it using Media Wiki, the free-to-use software 
that powers Wikipedia. A page for an ongoing 
experiment, for example, can easily be updated 
to note changes in a protocol or to include a 
graph of the latest results. 

Team members log on to the site through lap- 
tops or their iPads, which they wrap in ziplock 
bags before venturing to the bench. “Even with 
a finger in a glove we can still get touch-screen 
sensitivity, says Shih. When researchers want to 
sketch out an idea — an essential process in a lab 
working on self-assembling nanostructures — 
they use Adobe Ideas, software that, for US$10 
or less, makes it possible to construct detailed 
images on touch-screen devices. 

But these general-purpose tools don't always 
satisfy researchers’ needs. Some note-taking 
software, for example, does not handle tables 
well. And although MediaWiki is very flexible, 
some users say that it has a clunky interface 
for putting in text. This means that the race 
to create a good digital notebook — one that 
is both flexible and tailored to researchers’ 
requirements — is still on. 


GETTING PLUGGED IN 
Jonathan Gross is one of the front runners in 
that competition. Gross began creating lab soft- 
ware when he was a plant-biology researcher at 
the Hebrew University of Jerusalem. The tools 
he built gained a following among his colleagues 
and, in 2007, he quit the lab to found BioData, 
a company based in Rosh Ha‘ayin, Israel, that 
develops lab-management software. (BioData 
was purchased in 2010 by Digital Science, a 
sister company to Nature Publishing Group.) 
Alex Kentsis, a paediatric oncologist and 
haematologist at HMS, is an early adopter of 
Gross’s software. “Prior to BioData we hada 
makeshift operation,” he says. His group used 
to swap files over shared 


NATURE.COM drives, and Kentsis kept 
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project and each row an activity. It worked, but 
only kind of. External collaborators could not 
access the shared drives, and the spreadsheet 
required constant maintenance and was hard 
to interpret. 

BioData’s software, which was relaunched 
last December as Labguru, tackles those prob- 
lems. It's an online home for any information 
associated with scientific research, from pro- 
tocols and results to images and related papers. 
The material is grouped by project, which 
Gross says makes it easy to track progress 


“UNDERSTANDING 
YOUR DATA BETTERIS 
WHAT GETS PEOPLE 
REALLY EXCITED.” 


and to assemble the components of a paper. 
A tagging system makes it possible to group 
experiments that share a common component, 
such as a specific gene or cell line. External 
collaborators can use a password to access a 
specific project, which can be done from any- 
where because the data are stored in the cloud. 

Gross plans to add new functions: he says he 
is in discussions about integrating his system 
with Nature Protocols, a journal dedicated to 
recipes for lab procedures, and Addgene, a 
repository of information on genetic-research 
tools called plasmids. The hope is that Labguru 
users can seamlessly pull information from 
the external databases. “It's about making res- 
earchers lives as easy as possible,’ says Gross. 

Labguru is aimed at life-science labs, but 
there are alternatives for other fields. Andrew 
Phillips, a chemist at Yale University in New 
Haven, Connecticut, uses iLabber, a product 
from Stockholm-based Contur Software. The 
selling point for Phillips was iLabber’s ability to 
handle a chemist’s daily tasks, such as drawing 
molecular structures and calculating expected 
yields. “It’s chemically savvy,’ says Phillips, who 
says he pays $150 per user per year. Labguru 
charges academic labs a similar fee. 

At Syapse, Hirsch also has ambitious plans. 
The company’s software, Syapse Discovery, 
is undergoing testing in around 45 US labs. 
It combines the project-based structure of 
iLabber and Labguru with a layer of semantic 
technology that ‘knows’ the data that research- 
ers upload. For example, the Syapse software 
can automatically recognize that data are from 
a particular lab machine, such as a microarray 
or MRI, and apply ‘time’ and other appropriate 
headings to it. For other data, Syapse Discovery 
has a system that allows researchers to quickly 
select labels from a drop-down list. 
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With the tagging in place, researchers can use 
the software to run complex tasks that would 
generally require coding skills. Users might ask 
it to scan multiple experiments and to build 
a table that combines all results on a specific 
gene. Or, in a clinical setting, they might ask it 
how an experimental drug is affecting patients 
with a particular constellation of symptoms. 
“We want to give people the ability to access 
this information without them having to learn 
programming,’ says Hirsch. 


END OF AN ERA? 

All of this hints at a future in which iPads 
and other devices do more than just replace 
notebooks. Kentsis points to a study of his into 
a therapy for acute myeloid leukaemia. Only 
when he looked at the milestones he had high- 
lighted in BioData did he realize that a key piece 
of evidence — additional results on the extent 
to which drug resistance developed — was 
missing. The task could have been done using 
pen and paper, but BioData made it quicker. 
“Tt’s been easier to see project arcs and to direct 
research towards more important questions,” 
says Kentsis. 

Digital notebooks may also help researchers 
to probe correlations that are too time-con- 
suming to pursue using paper-based records. 
Archana Shenoy, a stem-cell biologist at the 
University of California, San Francisco, has 
been using Syapse Discovery for six months. 
She thinks that it can help to shed light on one 
of the frustrations of her field: cell lines that die 
without apparent cause. 

With a paper notebook it is almost impossible 
to correlate the myriad factors that might affect 
cell survival. But with Syapse Discovery, Shenoy 
can quickly record data ranging from the carbon 
dioxide levels in a cell incubator to the date 
when a new batch of reagents arrives. When 
the search capability is up and running, she will 
be able to look for correlations between these 
factors and the fate of her cells. “It's something 
everybody wonders about,’ she says. “What is 
causing these little changes?” 

Yet it’s unclear how much impact the new 
programs will have. Expense is one issue: not 
every lab can afford to equip its members with 
laptops or iPads. Practicality is another: some 
labs don’t allow laptops near the bench because 
of the risk of spills. But ultimately, these tools, 
like the earlier iterations of digital notebooks, 
will live or die on the basis of their usability. 

The greatest challenge for Syapse, BioData 
and their rivals is not to create a tool that can 
do everything, but one that, like the intuitive 
software produced by Apple, is fundamentally 
easy to use. The initial feedback from James 
and others suggests that the latest generation 
of lab software might be able to do both. If so, 
it might finally be time to turn the page on the 
notebook. = 


Jim Giles is a freelance writer based in 
San Francisco, California. 
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Production at oil fields globally, including at the Kern River oil field in Bakersfield, California, is declining at about 4-6% a year. 


e » e e e 
Oil’s tipping point has passed 
The economic pain of a flattening supply will trump the environment as a reason to 
curb the use of fossil fuels, say James Murray and David King. 


the United States, continuing debates 

about the quality of climate-change 
science and doubts about the scale of nega- 
tive environmental impacts have held back 
political action against rising greenhouse- 
gas emissions. But there is a potentially 
more persuasive argument for lowering 
global emissions: the impact of dwindling oil 
supplies on the economy. 

There is less fossil-fuel production 
available to us than many people believe. 
From 2005 onwards, conventional crude-oil 
production has not risen to match increasing 
demand. We argue that the oil market has 


I: many parts of the world, particularly 


tipped into a new state, similar to a phase 
transition in physics: production is now 
‘inelastic, unable to respond to rising 
demand, and this is leading to wild price 
swings. Other fossil-fuel resources don't 
seem capable of making up the difference. 
Such major spikes in fuel price can cause 
economic crises, and contributed to the 
one the world is recovering from now. The 
future economy is unlikely to be able to 
bear what oil prices have in store. Only by 
moving away from fossil fuels can we both 
ensure a more robust economic outlook and 
address the challenges of climate change. 
This will be a decades-long transformation’ 


that needs to start immediately. 
Production of crude oil increased along 
with demand from 1988 to 2005. But then 
something changed. Production has been 
roughly constant for the past seven years, 
despite an increase in price of around 15% 
per year* (at Brent crude (London) prices) 
from about US$15 per barrel in 1998 to more 
than $140 per barrel in 2008 (see ‘Oil pro- 
duction hits a ceiling’). The price still reflects 
demand: it declined to about $35 per bar- 
rel in 2009 thanks to the 2008-09 recession, 
and recovered along with the upturn in the 
global economy to $120 per barrel before 
declining to its value today of $111. But > 
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> the supply chain has been unable to keep 
pace with rising demand and prices. 

The idea of ‘peak oil’ — that global pro- 
duction will reach a peak and then decline 
— has been around for decades, with aca- 
demics arguing about whether this peak has 
already passed or is yet to come. The typical 
industry response is to point to increasing 
assessments of global reserves — the amount 
known to be in the ground that can be pro- 
duced commercially. But this is misleading. 
The true volume of proven global reserves 
is clouded by secrecy; forecasts by state oil 
companies are not audited and seem to be 
exaggerated’, More importantly, reserves 
often take 6-10 years to drill and develop 
before they become part of supply, by which 
time older fields have become depleted. It 
is far more sensible to look instead at actual 
production records, which are less encour- 
aging. Even while reserves are apparently 
increasing, the percentage available for pro- 
duction is going down. In the United States, 
for example, production as a percentage of 
reserves has steadily decreased from 9% in 
1980 to 6% today”. Production at existing oil 
fields around the world is declining at rates 
of about 4.5% (ref. 4) to 6.7% per year’. Only 
by adding in production from new wells is 
overall global production holding steady. 

In 2005, global production of regular 
crude oil reached about 72 million barrels 
per day. From then on, production capacity 
seems to have hit a ceiling at 75 million bar- 
rels per day. A plot of prices against produc- 
tion from 1998 to today” shows this dramatic 
transition, from a time when supply could 
respond elastically to rising prices caused by 
increased demand, to when it could not (see 
‘Phase shift’). As a result, prices swing wildly 
in response to small changes in demand. 
Other people have remarked on this step 
change in the economics of oil around the 
year 2005, but the point needs to be lodged 
more firmly in the minds of policy-makers. 


EASY ACCESS 
We are not running out of oil, but we are 
running out of oil that can be produced eas- 
ily and cheaply. The US Energy Information 
Administration optimistically projects a 30% 
increase in oil production between now and 
2030 (ref. 2). All of that increase is in the form 
of unidentified projects — in other words, oil 
yet to be discovered. Even if production at 
existing fields miraculously stopped declin- 
ing, such an increase would require 22 million 
barrels per day of new oil production by 2030. 
If realistic declines of 5% per year continue, 
we would need new fields yielding more than 
64 million barrels per day — roughly equiva- 
lent to today’s total production. In our view, 
this is very unlikely to happen. 
Non-conventional oil won't make up the 
difference. Production of oil derived from 
Canada’s tar sands — sometimes called the 


OIL PRODUCTION HITS A CEILING 


Production followed demand until 2005, when it levelled off despite continued price 


increases. There seems to be a production 'cap' at about 75 million barrels per day. 
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The abrupt change in oil economics can be seen in this scatter plot of production versus price. 
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‘oil junkie’s last fix’ — is expected to reach 
just 4.7 million barrels per day by 2035 (ref. 
6). Production from Venezuela's tar sands is 
currently less than 2 million barrels per day’, 
with little prospect of a dramatic increase. 
Many believe that coal will be the solution 
to our energy needs, and will stay cheap for 
decades. But several recent studies suggest 
that available coal is less abundant than has 
been assumed. US coal production peaked 
in 2002, and world coal-energy production 
is projected to peak as early as 2025 (ref. 8). 
Whenever coal-reserve figures are updated, 
the estimates are usually revised downwards: 
estimates of world reserves (79% of which are 
held in the United States, Russia, India, China, 
Australia and South Africa) were decreased 
by more than 50% in 2005, to 861 gigatonnes 
(ref. 9). That study put the ultimate produc- 
tion of coal (the total amount that humanity 
will be able to extract from the ground) at 
1,163 gigatonnes. A 2011 independent esti- 
mate of ultimate production came to just 
680 gigatonnes (ref. 10), some 40% lower 
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than the 2005 figure and about five times less 
than assumed by some older, high-coal-con- 
sumption scenarios of the Intergovernmental 
Panel on Climate Change. The US National 
Research Council’s Committee on Coal 
Research, Technology, and Resource Assess- 
ments to Inform Energy Policy noted in 
2007 that “present estimates of coal reserves 
are based upon methods that have not been 
reviewed or revised since their inception 
in 1974... updated methods indicate that 
only a small fraction of previously estimated 
reserves are actually mineable reserves.” 
Natural gas is still abundant and large 
discoveries have been made recently, notably 
in Israel and Mozambique last year. Power 
plants using natural gas provide 25%, and 
rising, of electricity generation in the United 
States. Production of conventional natural 
gas in North America peaked in 2001 (ref. 2), 
but energy companies have worked hard to 
promote the idea that hydraulic fracturing 
of shale rock will lead to ‘the age of natural 
gas. There is no doubt that US shale-gas 


SOURCE: REF. 2 


resources are immense, but recent reports 
suggest that both reserves and future 
production rates have been substantially 
overstated”. For sites such as the Barnett and 
Fayetteville shales, where a long production 
history can be studied, there has been an 
extremely large annual decline in produc- 
tion rates. Geological consultant Arthur 
Berman, director of Labyrinth Consulting 
Services in Sugar Land, Texas, and a world 
expert on shale gas, has put this decline in 
the range of 60-90%. For shale-gas wells that 
are more than five years old, about 30% are 
sub-commercial because of rapid decline 
combined with the low price of gas. 


STUNTED GROWTH 

What does this mean for the global economy, 
which is so closely tied to physical resources? 
Of the 11 recessions in the United States 
since the Second World War, 10, including 
the most recent, were preceded by a spike in 
oil prices’*. It seems clear that it wasn’t just 
the ‘credit crunch that triggered the 2008 
recession, but the rarely-talked-about ‘oil- 
price crunch’ as well. High energy prices 
erode family budgets and act as a head wind 
against economic recovery. 

The United States and Europe each spends 
$1 billion per day on oil imports. The aver- 
age price of petrol in the United States 
increased from 75 cents per litre in 2010 to 
95 cents per litre in 2011. Because the United 
States consumes about 1.4 billion litres per 
day, the nation spent about $280 million a 
day more on petrol in 2011, leaving less for 
discretionary items. 

Another powerful example of the effect of 
increasing oil prices can be seen in Italy. In 
1999, when Italy adopted the euro, the coun- 
try’s annual trade surplus was $22 billion. 
Since then, Italy’s trade balance has altered 
dramatically and the country now has a 
deficit of $36 billion. Although this shift has 
many causes, including the rise of imports 
from China, the increase in oil price was 
the most important. Despite a decrease in 
imports of 388,000 barrels per day compared 
with 1999, Italy now spends about $55 billion 
ayear on imported oil, up from $12 billion in 
1999. That difference is close to the current 
annual trade deficit. The price of oil is likely 
to have been a large contributor to the euro 
crisis in southern Europe, where countries 
are completely dependent on foreign oil. 

The International Energy Agency has 
made it very clear that the global economy is 
at risk when oil prices are greater than $100 
per barrel — as they have been in recent 
years, and will surely continue to be, given 
the inelastic response of global production. 

Historically, there has been a tight link 
between oil production and global economic 
growth. If oil production can’t grow, the 
implication is that the economy can’t grow 
either. This is such a frightening prospect 


that many have simply avoided considering 
it. The International Monetary Fund, for 
example, still projects economic growth of 
4% of gross domestic product for the next 
five years: near the top of the historical range 
since 1980. Yet to achieve that will require 
either a heroic increase in oil production of 
3% per year, increased efficiency of oil use, 
more energy-efficient growth or rapid sub- 
stitution of other fuel sources. Economists 
and politicians continually debate policies 
that will lead to a return to economic growth. 
But because they have failed to recognize 
that the high price of energy is a central 
problem, they haven't identified the neces- 
sary solution: weaning society off fossil fuel. 

The UK Industry Taskforce on Peak Oil 
and Energy Security and the UK govern- 
ment’s Department of Energy and Climate 
Change are very 


“The price aware of these risks, 
of oilis likely and have made a 
to have commitment to work 
been alarge together to protect the 
contributor to United Kingdom and 
the eurocrisis its economy from ris- 
in southern ing oil prices. The task 
Europe.” force, formed in 2008, 


warned that Britain 
must not be caught out by the oil crunch, 
and said that policies to address ‘peak oil’ 
must be made a priority. In 2011, its chair- 
man, John Miles of architects and design 
engineers Arup in London, said: “We must 
define the risks and develop sensible contin- 
gency plans. This means thinking critically 
about what we should be doing now if we 
knew that the oil price would soar over the 
next five years.’ Such joint industry/federal 
government recognition of the problem does 
not exist in the United States, where action 
has largely been at the state or city level. The 
UK government has embedded by parlia- 
mentary statute a commitment to decrease 
carbon dioxide emissions by 80% by 2050 
compared with 1990 levels. The US Congress 
has rejected any such commitment. 


FASTER ACTION 

Climate change and changes in fossil-fuel 
production are generally seen as separate 
phenomena. But they are closely linked. The 
risk of fossil-fuel supply limitation should be 
included when considering the uncertainties 
of future climate change. The approaches 
needed for tackling the economic impacts 
of resource scarcity and climate change are 
the same: moving away from a dependence 
on fossil-fuel energy sources. Whereas the 
implications of climate change have driven 
only slow policy responses, economic con- 
sequences tend to drive shorter-term action. 
We know from the historical record that 
when there are oil-price spikes, the economy 
begins to respond within a year. Govern- 
ments that fail to plan for the decline in 


fossil-fuel production will be faced with 
potentially major blows to their economies 
even before rising sea levels flood their coasts 
or crops begin to fail catastrophically. 

The solutions are not secret or mysterious. 
Globally we get 55 x 10’* joules of useful 
energy from 475 x 10”* joules of primary 
energy from fossil fuels, biomass and nuclear 
power plants. The difference is due to energy 
losses and inefficiencies in the conversion 
and transmission processes. By increasing 
the efficiency, we could get the same use- 
ful energy by burning less fuel. We need to 
specify conservation goals for improving 
the efficiency of use of fossil-fuel energy. 
These include taxing oil to keep prices high 
and to encourage a reduction in energy use; 
encouraging nuclear energy; questioning 
if and how economic growth can continue 
without an increase in fossil fuels; lower- 
ing speed limits on roads and encouraging 
public transport; or redirecting tax credits 
towards renewable-energy development. 
The transformation will take decades, so we 
must begin as soon as possible. Emphasizing 
the short-term economic imperative from 
oil prices must be enough to push govern- 
ments into action now. 
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Provocative questions 
in cancer research 


Harold Varmus and Ed Harlow explain how an innovative 
initiative is opening up neglected areas of oncology. 


hat is the best way for large 
research organizations to guide 
the expenditure of their funds? 


Governmental funding agencies, major 
philanthropic foundations and industrial 
firms bear a primary allegiance to those 
who pay for them — taxpayers, benefactors 
and shareholders, respectively. They are 
also dedicated to long-term goals — solving 
problems posed by disease, environmental 
change or national defence. 

The issue is hotly debated. At one end of 
the spectrum, organizations place decisions 
about the direction of research on the shoul- 
ders of individual investigators who conceive 
new proposals and the peers who judge their 
value. At the other, they define the research 
programmes and direct them towards their 
goals from the top down. 

As the director of the US National Cancer 
Institute (NCI) and a senior adviser to the 
director, we and others have developed a dif- 
ferent way to manage a significant portion of 
the agency’s research portfolio. Our Provoc- 
ative Questions initiative asks investigators 


to propose intriguing questions in cancer 
research that need attention but would 
usually find it difficult to get. The initia- 
tive does not replace the NCT’s traditional 
reliance on the imaginations of individual 
investigators; nor does it intend to restate 
obvious goals. Instead, it aims to engage a 
diverse range of scientists in a challenging 
intellectual exercise to define then solve the 
major unsolved or neglected problems in 
oncology. 

The initiative builds on a rich history of 
scientific discoveries that have stemmed 
from well conceived questions. Such 
questions recognize the role of natural 
phenomena, reflect technical feasibility 
and convey the importance ofa satisfactory 
answer. They point to alternative directions 
in research, new uses of technologies, for- 
gotten observations or under-studied but 
important problems. They bypass the obvi- 
ous and emphasize the provocative. Properly 
constructed and phrased, they excite curios- 
ity, stimulate ideas and inspire progress. 

Similar strategies in disciplines ranging 
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from mathematics to global health serve 
as precedents. These efforts have tended to 
set goals (the United Nations’ Millennium 
Development Goals') or post challenges 
(David Hilbert’s Grand Challenges in Math- 
ematics” or the Gates Foundation’s Grand 
Challenges in Global Health’). For the NCI, 
however, we chose to pose questions because 
we believe that good questions lie behind 
much of the best work in science. How does 
something work? Where are the key points 
of control? How can we explain unexpected 
observations or apparently contradictory 
findings in mechanistic terms? 

Over the past 18 months, we have 
developed the Provocative Questions initia- 
tive from a concept into a pragmatic strategy 
for supporting new grants. We have hosted 
community-wide activities to gather ques- 
tions; evolved ways to refine and select the 
most powerful ones; and solicited proposals 
to answer them. 

To develop a list of key questions related 
to cancer, the NCI conducted 16 day-long 
workshops on the National Institutes 


ILLUSTRATION BY SERGE BLOCH 


of Health (NIH) campus in Bethesda, 
Maryland, and at several other locations 
throughout the United States. Around 
30 investigators from different scientific 
disciplines and at varying stages of their 
careers posed and discussed questions and 
judged their importance. We then moulded 
the most appealing ones into a format that 
explains the importance of the question, 
the likelihood that it will be answered and 
the ramifications of doing so. The work- 
shops have been revelatory in another, less 
expected way. Minimally structured, they 
were immensely enjoyable, offering hours of 
uninhibited and unconventional exchanges. 

To extend this exercise beyond the small 
number of people who could attend the 
workshops, the NCI has created a website 
(provocativequestions.nci.nih.gov). There, 
anyone can learn about the motivations 
behind the initiative, and people have been 
able to propose additional questions and 
comment on those suggested at workshops 
or online. The site now lists more than 
100 questions and has been visited by more 
than 35,000 scientists throughout the world, 
at a rate of roughly 1,800 hits per day. 

To formalize the initiative, the NCI has 
issued a request for applications and pro- 
visionally set aside US$15 million from the 
budget for fiscal year 2012 to support the 
best ideas for answering any of the ques- 
tions chosen from a list of 24. Applications 
for funding will be judged by special NCI 
study sections, the members of which will 
be asked to assess the relative power of the 
proposed ideas, rather than the support- 
ing preliminary data or the reputation of 
an applicant. 

Four examples of under-studied areas cho- 
sen for funding are outlined below. Detailed 
descriptions can be found at the website. 


FOUR EXAMPLES 

Drug mechanisms: Although conventional 
chemotherapy is often regarded as a sub-opti- 
mal treatment strategy, with high toxicity and 
less than curative outcomes in common adult 
cancers, there are several well documented 
situations (such as germ-line or paediatric 
cancers) in which these therapies can provide 
complete cures*. For example, cisplatin cures 
most patients with advanced testicular cancer 
and has been the standard treatment for dec- 
ades. Perhaps because it has been so success- 
ful, few studies of this drug response are being 
conducted. Yet knowing why this drug and 
others can produce such durable cures could 
illuminate the search for similar opportunities 
for cancers that are less successfully treated. 


Cancer evolution: The emergence of 
tumours resistant to both traditional chem- 
otherapies and newer targeted treatments 
involves selection of cancer-cell variants 
that have acquired properties that block 


or sidestep the action of the drug’. As an 
evolutionary biologist pointed out dur- 
ing one workshop, little attention has been 
directed to the application of ideas about how 
Darwinian selection by massive cell killing 
might drive the emergence of drug-resistant 
tumour cells. Applying sucha strong selective 
pressure may not be the best way to under- 
mine the behaviour of cancer cells. 


Obesity risk: Epidemiologists have estab- 
lished that obesity dramatically increases the 
risk of developing some cancers, and in some 
cases the risk rivals that for tobacco®*. One 
of the Provocative Questions asks scientists 
to consider how obesity promotes tumour 
development. Uncovering the molecular 
mechanisms by which this happens could 
help to identify new approaches to cancer 
prevention or treatment. Such an effort 
would be a powerful example of how to link 
risk identification to the molecular steps that 
drive carcinogenesis. 


Ageing and cancer: Cancers are commonly 
considered to be associated with ageing, 
presumably because of the accumulation 
of mutations affecting cancer genes. How- 
ever, some cancers peak in childhood, 
adolescence, young adulthood or middle age. 
Moreover, the occurrence of cancers does not 
correlate well with the lifespan of an animal: 
mice, which have a two-year life expectancy, 
are much more prone to cancer than are tur- 
tles or marine mammals that live longer than 
human beings. What accounts for these dis- 
crepancies? Does our understanding of the 
process of ageing help to explain the patterns 
of carcinogenesis as a function of age? 


MEASURING SUCCESS 
Because the Provocative Questions initiative 
is an experiment, the NCI is using a combina- 
tion of subjective and objective assessments 
to measure its success. The workshops were 
welcomed enthusiastically: almost all invita- 
tions to participate were accepted, and we 
received many testimonials afterwards. The 
response to the request for applications has 
also been strong — perhaps not surprising at 
a time when success rates for new grants are 
at an all-time low across the NIH. We have 
processed more than 750 applications, with 
every question receiving interest from multi- 
ple investigators. The response suggests that 
the initiative is meeting a pent-up demand 
for more imaginative funding opportunities. 
Of course, the proof of principle — finding 
satisfying answers to a significant number 
of Provocative Questions or developing 
sustained research programmes to expand 
the pursuit of answers — will take years to 
accomplish. In the meantime, we plan to 
expand the list of questions through addi- 
tional workshops and the website, to invite 
requests for applications annually for at least 


another two years and to consider whether 
to expand the initiative in the more distant 
future. 

We believe that the Provocative Questions 
model will help funding agencies in other 
fields. The approach helps to define the 
boundaries between the known and 
unknown, it takes advantage of new develop- 
ments to refocus a discipline’s attention on 
historically intractable problems; and it high- 
lights perplexing issues that may be raised by 
new evidence about traditional topics. 

An initiative of this type combats the 
pressures for ‘safe havens, which are com- 
mon in eras of budgetary stringency. Reduc- 
tions in funding tend to prompt all parts of 
the research community to become more 
conservative. Grant writers look to the saf- 
est harbours for research ideas and often 
converge on similar subjects, narrow- 
ing research portfolios. In turn, reviewers 
respond conservatively; popular ideas are 
likely to score well, again narrowing the 
scope of approaches. 

By pooling the imaginations of the 
research community to address understud- 
ied areas, an initiative such as ours provides a 
useful compromise between two traditional 
approaches of funding agencies: one that 
focuses on goals set by portfolio managers 
and another that is entirely unstructured. 

The benefits will also be evident in fields 
that are progressing rapidly. In our research 
area of oncology, we are witnessing an explo- 
sion of knowledge about the genetic basis and 
underlying biology of many cancers, matched 
by the development of powerful technologies 
for expanding and applying that knowledge 
to the classification and management of dis- 
ease’. Under these circumstances itis easy to 
become overconfident in the progress that is 
occurring. Provocative Questions are wel- 
come antidotes, as well as productive guides 
to future research. m 


Harold Varmus is the director of the US 
National Cancer Institute and Ed Harlow is 
a senior adviser to the director. 

e-mail: harold. varmus@nih. gov 
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Joseph Rotblat shared the Nobel Peace Prize for his efforts to rid the world of nuclear weapons. 


NUCLEAR PHYSICS 


Arms andthe man 


Martin Rees applauds a biography of the physicist who 
kickstarted the Pugwash Conferences for arms control. 


he physicists who developed nuclear 
[oon during the Second World 

War mostly returned to academic 
pursuits when peace was declared. But some 
did what they could, however little, to control 
the powers that they had helped to unleash. 
The most committed and idealistic of these 
was Joseph Rotblat — the instigator and 
driving force behind the Pugwash Confer- 
ences on Science and World Affairs, which 
aim to reduce the danger of armed conflict 
through global cooperation. 

Radiation oncologist Andrew Browns fine 
biography, Keeper of the Nuclear Conscience, 
chronicles a life spanning almost the entire 
twentieth century and moulded by its tur- 
moil and horrors. Rotblat was born in Poland 
in 1908, and his family suffered severe priva- 
tions in the First World War. But he was ener- 
getic and persistent — a lifelong trait — and, 
by the age of 30, had achieved international 
standing in radioactivity research. 

In 1939 he left the Warsaw Radiological 
Laboratory in Poland to take up a short-term 


post with physicist 
James Chadwick at 
the University of Liv- 
erpool, UK. This was 
a fortunate move: 
the Warsaw lab was 


destroyed a year later, 
and few ofits staff sur- 
vived the Nazi occupa- 
Keeper of tion. Rotblat quickly 
the Nuclear became Chadwick’s 
pple raat trusted lieutenant. 
Joseph Rotblat Chadwick was part of 
ANDREW BROWN a team studying the 
Oxford University feasibility of a fission 


Press: 2012. 368 pp. 
£18.99, $29.95 


weapon under the 
code names Maud and 
Tube Alloys. When 
this UK effort was subsumed into the US 
Manhattan Project in 1943, Chadwick used 
his influence to enable Rotblat to move to 
the Los Alamos Laboratory in New Mexico, 
despite his Polish citizenship. 

Since 1940, nuclear physicists had feared 
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a nightmare scenario in which Adolf Hitler 
developed atomic weapons. For Rotblat, 
this was the only moral justification for the 
Allies’ bomb project. He was, famously, the 
only scientist to leave the Manhattan Project 
when, by 1944, this threat no longer seemed 
realistic. He recounted that the trigger was 
a comment by Leslie Groves, the military 
head of the project, that the bomb could be 
used against the Soviet Union. But Brown 
elaborates the complex motives and per- 
sonal turmoil behind this decision. Rotblat 
had heard nothing from his wife since 1939, 
and learnt only later that she had perished in 
a concentration camp. And he was mindful 
that US authorities were concerned that his 
secret knowledge might reach Poland and 
fall into the hands of the Nazis or Soviets. 

Rotblat returned to Liverpool, helping to 
rebuild the physics department, but moved 
in 1950 to a position in radiation medicine 
attached to St Bartholomew’s Hospital in 
London. He first gained public prominence 
in 1954. By analysing radioactive dust from 
a Japanese fishing boat that strayed too close 
to a US thermonuclear test, he inferred 
key features of the bomb’s design. He also 
appeared, with British philosopher Bertrand 
Russell, in an early Panorama programme 
on BBC television, highlighting the hazards 
of nuclear fallout. Russell subsequently pre- 
pared a manifesto against nuclear weapons, 
signed by Albert Einstein and other eminent 
scientists, which declared that the signato- 
ries were speaking “not as members of this 
or that nation, continent, or creed, but as 
human beings, members of the species Man, 
whose continued existence is in doubt”. 

Bolstered by this initiative, in 1957 Rotblat 
arranged for a select group of Eastern and 
Western scientists to meet in the Canadian 
village of Pugwash, Nova Scotia, for private 
discussions of how to control nuclear arms. 
Thus began the Pugwash conferences and 
workshops. There have now been around 
300; Rotblat attended almost all of them 
before his death in 2005. Funding was a 
continual challenge, especially because it was 
crucial to avoid sponsorship from propagan- 
dist bodies. 


BAN BROKER 

How much influence did these non-gov- 
ernmental gatherings have? During the 
1960s, the Pugwash Conferences undoubt- 
edly offered crucial back-channel contact 
between the United States and the Soviet 
Union. This eased the path for the Partial 
Test Ban Treaty of 1963, and the subse- 
quent anti-ballistic missile and non-prolif- 
eration treaties. The Pugwash agenda later 
broadened to include 


biological weapons ODNATURE.COM 
and problems of the  Formore onuoseph 
developing world.  RotblatandPugwash: 
Brown argues thatthe — go.nafure.com/|jGnn2 
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conferences thereby became less distinctive, 
and that their influence was diluted. 

Brown also notes the tension between 
those who wished to promote piecemeal 
progress and those, such as Rotblat, who did 
not wish to lose sight of the long-term aim to 
rid the world of nuclear weapons. This latter 
view was widely derided as woolly idealism. 
But it gradually gained broader establish- 
ment support. In his later years, for instance, 
Robert McNamara, defence secretary to US 
presidents John EF. Kennedy and Lyndon 
Johnson, attended several Pugwash meet- 
ings. This might have seemed incongruous, 
as did Rotblat’s friendship with Soviet Union 
President Mikhail Gorbachev. But these men 
came together with the realization that elim- 
inating nuclear weapons should be an even- 
tual goal. This view has become mainstream, 
espoused recently by the US ‘gang of four’ 
(former secretaries of state Henry Kissinger 
and George Schultz, former defence secre- 
tary William Perry and former Senator Sam 
Nunn) and proclaimed in President Barack 
Obama's 2009 speech in Prague. 

When the Pugwash Conferences were 
recognized by the 1995 Nobel Peace Prize, 
half of the award went to the organization 
and half to Rotblat personally. It was charac- 
teristic that he donated his half of the prize 
money to the organization. 

That was also the year of Rotblat’s absurdly 
overdue election to the Royal Society — 
recognition that meanta lot to him. Ina mem- 
orable speech, he warned against exchanging 
the ivory tower for secret work. He quoted 
Solly Zuckerman, long-term adviser on 
nuclear policy to the UK government: “When 
it comes to nuclear weapons ... it is he, the 
technician, not the commander in the field, 
who is at the heart of the arms race”. 

Until his last few months (he died aged 96), 
Rotblat travelled the world with the resilience 
ofa man half his age. He was concerned about 
the hazards that could stem from the misuse 
of twenty-first-century science, so was keen 
to convey his disquiet beyond the Pugwash 
community, and to a younger generation. 
He favoured a ‘Hippocratic oath’ whereby 
scientists would pledge to use their talents for 
human benefit. Whether or not such an oath 
would have substance, there can be no doubt 
of his success at raising awareness. Even in his 
nineties, he could still captivate students. 

Brown's balanced and comprehensive 
biography is welcome. Rotblat’s inspiring life 
— against a backdrop of tragedy and hard- 
ship, with idealism but without illusions — 
deserves to be better known. = 


Martin Rees is master of Trinity College, 
Cambridge, UK; former president of the 
Royal Society; and author of From Here 
to Infinity: Scientific Horizons. He has 
attended several Pugwash meetings. 
e-mail: mjr36@cam.ac.uk 


Books in brief 


A User’s Guide to Thought and Meaning 

Ray Jackendoff OXFORD UNIVERSITY PRESS 288 pp. £16.99 (2012) 
Linguist Ray Jackendoff wends his way through thorny questions 
around language and thought, hedge-clippers in hand. Covering 
words and meaning, consciousness and perception, reference, 
rationality and intuition, he reveals the split between how we 
experience the world and how perception and language are 
organized. His conclusion is that rational thought rests on 
intuition — and that meaning and thought are almost completely 
SJ unconscious. The book is a multi-perspective journey into the 
hinterlands of the ‘problem of knowledge’. 


f — The Science of Yoga: The Risks and the Rewards 

| ; William J. Broad SIMON & SCHUSTER 336 pp. £16.63 (2012) 

| THe ee Whether it is hatha, ashtanga or kundalini, yoga is swathed in hype. 

pes, | This ancient discipline is a vast and growing industry in the West, 

Y | and ripe for investigation. Science writer William Broad, a two- 

| time Pulitzer prizewinner and experienced yoga practitioner, has 

| spent five years researching the teaching and practice. The result 
is illuminating and often unnerving. Yoga has proven physical and 
psychological benefits, yet Broad shows how even ordinary poses, 

lg poorly taught to the unfit, can lead to strokes, serious hip-joint 
damage and worse. 


Steam-Powered Knowledge: William Chambers and the Business 
of Publishing, 1820-1860 

Aileen Fyfe UNIVERSITY OF CHICAGO PRESS 336 pp. $50 (2012) 

As e-books rise, it is salutary to recall another technological 
revolution in bringing words to mind: the steam-powered printing 
press. Historian Aileen Fyfe focuses on Edinburgh publishing firm 
W. & R. Chambers to reveal how the cheap print that flooded both 
sides of the Atlantic in the 1800s transformed the dissemination of 
knowledge. This chronicle of great endeavour is studded with small 
pleasures, from the horror over ‘sordid’ railway literature to engineer 
Charles Babbage’s awe of printing speeds at The Times. 


Across Atlantic Ice: The Origin of America’s Clovis Culture 
Dennis J. Stanford and Bruce A. Bradley UNIVERSITY OF CALIFORNIA 
PRESS 336 pp. $34.95 (2012) 

North America’s first peoples were long thought to be Asians 
who migrated over the Bering land bridge some 12,000 years 
ago, bringing with them the tools of the Clovis culture. Now 
archaeologists Dennis Stanford and Bruce Bradley have radically 
recast the story. Drawing on climatic, genetic and archaeological 
evidence, they argue that the roots of Clovis culture rest in the 
Solutrean people of Spain and France, who sent some of their 
number across the Atlantic in boats 18,000 years ago. 


The Puzzler’s Dilemma: From the Lighthouse of Alexandria 

to Monty Hall, a Fresh Look at Classic Conundrums of Logic, 
Mathematics, and Life 

Derrick Niederman DUCKWORTH 216 pp. £14.99 (2012) 
Mathematician Derrick Niederman takes classic logic puzzles and 
weaves them into a fiendish brain-teaser of a narrative. He covers 
‘kangaroo’ puzzles that contain their own answers, lateral-thinking 
posers, and even riddles that abide by Sherlock Holmes’s great 
technique — eliminating the impossible so that “whatever remains, 
however improbable, must be the truth”. 
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The cooling Universe enabled galaxies to form, as this simulation shows. 


COSMOLOGY 


Plucked from 
the vacuum 


A tale of multiverses, cosmic inflation and dark energy 


grips Caleb Scharf. 


the predictions of fundamental physics 

converge with the observation of nature 
on cosmic scales. The result is an increas- 
ingly convincing picture that we, and the 
Universe as a whole, owe our existence to 
the instability of ‘empty’ space at a quantum 
level. Moreover, space and the laws of phys- 
ics may be just one variant among many pos- 
sible ways to assemble universes, separated 
by dimension or by scale. 

How physicists came up with the current 
model of the cosmos is quite a story, and to 
tell it in his elegant A Universe From Nothing, 
physicist Lawrence Krauss walks a carefully 
laid path. The evolving Universe of finite age 
has pushed physicists to tackle ever more pro- 
found and challenging questions using some 
of the most innovative tools ever forged. 

Krauss begins in 1916, when Albert Ein- 
stein fudged the field equations in his general 
theory of relativity to ensure that they mod- 
elled a static Universe. To do this he addeda 
small constant repulsive force, equivalent to 
endowing empty space with energy. Within 
a few years, Edwin Hubble and other astron- 
omers demonstrated that, in fact, the Uni- 
verse is expanding and dynamic. Einstein 
removed the force from his equations. In the 
following decades came the recognition that 
most matter is dark and only weakly inter- 
acting, and that the observed cosmic mix of 
light elements, such as hydrogen, helium and 


I: less than 100 years, humanity has seen 


lithium, is the result of 
primordial nucleosyn- 
thesis in the first min- 
utes after the Big Bang, 
when newly stable 
protons and neutrons 
combined in a frenzy 
of nuclear fusion. 
Another milestone 


in the story is the AUniverse 
measurement, in 1992, "0m Nothing: 

of hot and cold patches why Tiere Ie 

‘ath fmi Something Rather 
in th eseaofmicrowave Than Nothing 
radiation that was scat- | AWRENCE M. KRAUSS 
tered asthe plasmaofa Free Press: 2012. 


224 pp. $24.99, 
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verse cooled enough 
to form atoms. These 
patterns reveal the seeds of galactic structures 
in today’s Universe and offer striking confir- 
mation of current cosmological theories. 
But just as all seemed to be fitting together 
well, astronomers turning their telescopes 
to distant, dimmed supernovae saw that the 
expansion of the space between the stars was 
accelerating rather than decelerating. Like 
Einstein, physicists are again stepping back 
to look for a way to tie the strands together. 
Mathematical investigations of the atomic 
and subatomic world provide a plausible 
solution. The overall smoothness and geo- 
metrical regularity of the Universe could 
have resulted from its rapid inflation from 
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quantum scales during its earliest stages. 
The later accelerating expansion might be 
driven by the ‘dark energy’ of the vacuum 
itself — a seething ocean of virtual particle 
pairs popping in and out of existence as a 
result of quantum uncertainty. 

Furthermore, Krauss points out, our Uni- 
verse seems to have a net gravitational energy 
that is suspiciously close to zero: its exist- 
ence may ‘cost’ nothing, requiring no energy 
input. This raises the possibility of the ulti- 
mate free lunch — of acosmos that is merely 
a piece of borrowed stuff, having appeared 
spontaneously, like a virtual particle, and 
been filled with matter and radiation sim- 
ply as a consequence of the energy of empty 
space. Ours may be one of an infinite array 
of universe-like things, just one instance in 
a multiverse. 

It would be easy for this remarkable story to 
revel in self-congratulation, but Krauss steers 
it soberly and with grace, taking time to let the 
reader digest the material. His discussion of 
the multiverse is a good example: he lays out 
the possibilities and scientific and philosophi- 
cal implications without beating the drum 
for any one hypothesis. His asides on how he 
views each piece of science and its chances of 
being right are refreshingly honest. 

He notes that a number of vital empirical 
discoveries are, ominously, missing from our 
cosmic model. Dark matter is one. Despite 
decades of astrophysical evidence for its 
presence, and plausible options for its origins, 
physicists still cannot say much about it. We 
don't know what this major mass component 
of the Universe is, which is a bit of a predica- 
ment. We even have difficulty accounting 
for every speck of normal matter in our local 
Universe. This does not mean that some- 
thing is wrong with the current picture, but 
that we astronomers should be uncomfort- 
able about embracing a phenomenon such as 
dark energy when we still have a mess to tidy 
up elsewhere. 

Krauss eases such concerns by showing 
that the Universe's accelerating expansion 
fits astonishingly well with a host of its other 
characteristics. He shows how the theories of 
the physical sciences — from the subatomic 
to the cosmological — unite sublimely. 

What does this mean for humanity? In a 
provocative afterword, evolutionary biologist 
and atheist Richard Dawkins writes that the 
apparent near-inevitability of something aris- 
ing out of unstable nothingness, as described 
by Krauss, is devastating for theologians and 
creationists. Dawkins is right. But it is also 
invigorating for the rest of us, because in this 
nothingness there are many wonderful things 
to see and understand. = 


Caleb Scharf is director of astrobiology at 
Columbia University, New York, USA. His 
forthcoming book is Gravity’s Engines. 
e-mail: caleb@astro.columbia.edu 
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Science gets real 


at Sundance 


This year’s crop of research-related films moves away 
from ‘mad genius’ mode, finds Jascha Hoffman. 


ollywood likes its science 
H catastrophic and its researchers 

portrayed as either reckless geniuses 
or crazed loners. Will we ever have a real- 
istic rendering of the benign innovators at 
work in a lab near you? The answer could 
be a qualified yes, judging from the lineup 
at this year’s Sundance Film Festival in Utah. 

The cinematic celebration, now in its 34th 
year, promotes US film outside Hollywood 
and has launched the career of directors such 
as Steven Soderbergh, whose epidemiologi- 
cal thriller Contagion did well last year. For 
2012, the festival has embraced the bright 
side of science in its documentary-heavy 
programme. 

“Most of these films are positive in their 
view of science as a means to improve our 
lives,” said John Nein, a film programmer 
at the Sundance Institute in Los Angeles, 
California. Many of the festival's 100 or so 
features bear out Nein’s words. 

Among a handful of documentaries con- 
cerned with climate, energy and the environ- 
ment, one stands out 
for its sheer ambition. 
Chasing Ice, directed 
by Jeff Orlowski, 
follows dogged 


> NATURE.COM 
For Isabella 
Rosselini’s take on 
science and film: 
go.nature.com/urek25 


National Geographic 
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Sundance Film 
Festival 

Park City, Utah. 
Until 29 January. 


photographer James 
Balog as he deploys 
30 cameras on three 
continents to gather 
time-lapse footage of glaciers melting and 
ice shelves collapsing. The result is a graphic 
portrait of global warming. Director Mark 
Kitchell’s A Fierce Green Fire is a sweeping 
history of environmental activism in the 
past half-century, with biologist E. O. Wil- 
son as adviser. The Atomic States of America, 
directed by Don Argott and Sheena Joyce, 
sounds a warning of the health and environ- 
mental costs of nuclear energy. 

Also on the factual front is a set of docu- 
mentaries showing how science improves 
public life through education, medicine and 
criminal justice. In The D Word: Understand- 
ing Dyslexia (directed by James Redford, son 
of Sundance president and founder Robert 
Redford), physicians from the Yale Center for 
Dyslexia and Creativity in New Haven, Con- 
necticut, explain the neurological underpin- 
nings of dyslexia, revealing how some of its 
features may account for the success of entre- 
preneur Richard Branson and others with the 
condition. 

David France’s How to Survive a Plague 
explores the controversial efforts of AIDS 
activists in the 1980s and 1990s to push 


Time-lapse footage of melting glaciers in Chasing Ice offers a graphic glimpse of global warming in action. 
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experimental treatments through the drug 
pipeline. And West of Memphis, directed by 
Amy Berg and co-produced by The Hobbit 
director Peter Jackson, delves into the finer 
points of forensic science. It tells the story 
of three Arkansas teenagers who, convicted 
of murdering three boys, were released two 
decades later on the basis of DNA evidence. 

Other offerings are more personal and 
research-focused. Homing in on freshwater 
biology, Chris Peters’s The Diatom is a medi- 
tative glimpse of a microbiologist tracking 
the unicellular organisms that thrive in 
Utah’s remote mountain reservoirs. Valley 
of Saints, directed by Musa Syeed, follows a 
biologist and the boatman she falls in love 
with as they discover serious pollution ina 
pristine lake in Kashmir. 

Two star-studded films offer comic relief. 
Director Jake Schreier’s dramatic comedy 
Robot and Frank recounts the budding 
friendship between an elderly thief and 
the household robot that cooks and cleans 
for him, and claims to illuminate “the 
implications of humankind’s ever-changing 
relationship with technology”. Red Lights, by 
director Rodrigo Cortés, features a research 
duo played by Sigourney Weaver and Cil- 
lian Murphy as a pair of sceptics who debunk 
clairvoyants. Until, that is, they discover a 
blind psychic, unaccountably played by Rob- 
ert De Niro, whose apparent powers threaten 
their faith in reason. 

Festival-goers may be lured from their 
seats for conversations on the future of film- 
making. A talk that questions ‘the new future 
normal’ speculates on tomorrow’s beneficial 
technologies, from camera contact lenses to 
regularly upgraded ‘hyperhumanity. A panel 
on good science in films, sponsored by the 
Alfred P. Sloan Foundation, will ask what 
would happen if robots or viruses helped 
humanity instead of wiping it out. Panellists 
include biologists Helen Fisher and Robert 
Full, and Contagion screenwriter Scott Burns. 

Science fiction and allegorical drama also 
get a look-in. Beasts of the Southern Wild, 
directed by Benh Zeitlin, is a moody film 
about a six-year-old girl in a flooded world 
rife with prehistoric creatures. Director 
Colin Trevorrow’s Safety Not Guaranteed is 
a romantic comedy hinging on time travel. 
There are even classic blood-spattered for- 
ays. In Jon Wright’s monster feature Grabbers, 
“giant, bloodsucking, tentacled aliens” rise 
from the sea to prey on boozy Irish villagers, 
despite the efforts of a marine biologist and 
two local police officers. 

There’s always room for a little bone- 
crunching horror — if only for contrast to 
the nuanced, realistic cinematic science we 
see dawning at Sundance. m 


Jascha Hoffman is a journalist based in 
San Francisco, California. 
e-mail: jascha@jaschahoffman.com 
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Correspondence 


Pause on avian flu 
transmission studies 


The continuous threat of an 
influenza pandemic represents 
one of the biggest challenges 

in public health. Influenza 
pandemics are known to be 
caused by viruses that evolve 
from animal reservoirs, such as 
birds and pigs, and can acquire 
genetic changes that increase 
their ability to transmit in 
humans. Pandemic preparedness 
plans have been implemented 
worldwide to mitigate the 
impact of influenza pandemics. 
A major obstacle in preventing 
influenza pandemics is that little 
is known regarding what makes 
an influenza virus transmissible 
in humans. As a consequence, 
the potential pandemic risk 
associated with the many 
different influenza viruses of 
animals cannot be assessed with 
any certainty. 

Recent research breakthroughs 
identified specific determinants 
of transmission of HSN1 
influenza viruses in ferrets. 
Responsible research on 
influenza virus transmission 
using different animal models 
is conducted by multiple 
laboratories in the world using 
the highest international 
standards of biosafety and 
biosecurity practices that 
effectively prevent the release of 
transmissible viruses from the 
laboratory. These standards are 
regulated and monitored closely 
by the relevant authorities. This 
statement is being made by the 
principal investigators of these 
laboratories. 

In two independent studies 
conducted in two leading 
influenza laboratories at the 
University of Wisconsin- 
Madison and Erasmus MC in 
Rotterdam, the Netherlands, 
investigators have proved 
that viruses possessing a 
haemagglutinin (HA) protein 
from highly pathogenic avian 
H5N1 influenza viruses can 
become transmissible in ferrets. 
This is critical information that 
advances our understanding 


of influenza transmission. 
However, more research is 
needed to determine how 
influenza viruses in nature 
become human pandemic 
threats, so that they can be 
contained before they acquire 
the ability to transmit from 
human to human, or so that 
appropriate countermeasures 
can be deployed if adaptation to 
humans occurs. 

Despite the positive public- 
health benefits these studies 
sought to provide, a perceived 
fear that the ferret-transmissible 
H5 HA viruses may escape 
from the laboratories has 
generated intense public debate 
in the media on the benefits and 
potential harm of this type of 
research. We would like to assure 
the public that these experiments 
have been conducted with 
appropriate regulatory oversight 
in secure containment facilities 
by highly trained and responsible 
personnel to minimize any risk 
of accidental release. Whether 
the ferret-adapted influenza 
viruses have the ability to 
transmit from human to human 
cannot be tested. 

We recognize that we 
and the rest of the scientific 
community need to clearly 
explain the benefits of this 
important research and the 
measures taken to minimize 
its possible risks. We propose 
to do so in an international 
forum in which the scientific 
community comes together to 
discuss and debate these issues. 
We realize that organizations 
and governments around the 
world need time to find the best 
solutions for opportunities and 
challenges that stem from the 
work. To provide time for these 
discussions, we have agreed on 
a voluntary pause of 60 days on 
any research involving highly 
pathogenic avian influenza 
H5NI viruses leading to the 
generation of viruses that are 
more transmissible in mammals. 
In addition, no experiments with 
live H5SN1 or H5 HA reassortant 
viruses already shown to be 
transmissible in ferrets will be 


conducted during this time. 

We will continue to assess 

the transmissibility of HSN1 
influenza viruses that emerge 

in nature and pose a continuing 
threat to human health. 

Ron A. M. Fouchier Erasmus 
MC, Rotterdam, the Netherlands. 
Adolfo Garcia-Sastre Mount 
Sinai School of Medicine, 

New York, USA. 

Yoshihiro Kawaoka University 
of Tokyo, Japan; and University of 
Wisconsin-Madison, Wisconsin, 
USA. 
kawaokay@svm.vetmed. wisc.edu 
For a full list of co-authors, see 
go.nature.com/mwqvoo 


Train local experts to 
help conserve forests 


The management of tropical 
forests by indigenous 
communities requires a formal 
intervention plan based on 
quantitative inventories, 
growth studies and permanent 
monitoring systems. But in our 
30 years of experience, few such 
communities, if any, would be 
able to produce a plan on their 
own. This shortcoming needs to 
be rectified. 

One answer would be to train 
village-based resource managers, 
who could use their knowledge 
and skills in forestry, ecology, 
ethnobotany, economics and 
anthropology to help local 
communities manage forests 
and interact with the state, 
commercial markets and the 
global conservation community. 
This would reduce dependency 
on external inputs, draw on 
local skills, promote community 
organization and self-governance, 
and help to implement ways 
of reducing emissions from 
deforestation and degradation. 

Crucially, community-owned 
forests would then stay in the 
hands of those who have lived 
in, used and looked after them 
for so long. 

Charles M. Peters The New York 
Botanical Garden, New York, USA. 
cpeters@nybg.org 
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Biobank donors 
should have a say 


Our findings from a qualitative 
survey of patients, health-care 
providers, researchers and 
other members of the public in 
Switzerland support suggestions 
that the views of prospective 
tissue donors should be taken 
into account in securing 
biobank samples (Nature 478, 
312-313; 2011). 

In 2009-10, we interviewed 
90 people in focus groups, using 
fictitious case studies to explore 
perceptions about the use of 
human tissue. 

Those professionals who 
preferred blanket consent from 
donors commonly expressed 
fears that providing information 
on tissue use to patients could 
make them reluctant to donate 
samples. The general population 
and patient groups, by contrast, 
usually wanted to be informed 
and have the opportunity to give 
consent. 

Among patients, the need to 
be informed was linked to a wish 
to be appraised of their medical 
condition and to be a partner 
in the therapeutic process. 
Cancer patients were particularly 
enthusiastic about the use of their 
tissues because it helped them to 
make sense of their illness and 
could potentially benefit others. 
Brenda Spencer, Daria 
Koutaissoff, Hans-Anton Lehr 
Lausanne University Hospital, 
Switzerland. 
brenda.spencer@chuv.ch 
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OBITUARY 


James Crow 


(1916-2012) 


Population geneticist who studied mutation, selection and random drift. 


uch about James Franklin Crow, 
Me died on 4 January two weeks 

short of his 96th birthday, chal- 
lenges our sense of scale. Over seven dec- 
ades, he contributed to an astonishing array 
of topics in genetics, and the list of his stu- 
dents and postdocs reads like a who's who. 
One of them, the pioneering geneticist 
Motoo Kimura, wrote that getting Crow as 
his adviser after a period of uncertainty was 
such a joy it was like “meeting Buddha in 
Hell”. Crow also played the viola for 45 years 
with the Madison Symphony Orchestra. He 
once performed with the great 
violin soloist Yehudi Menuhin. 

Crow is primarily known for 
his work in theoretical popu- 
lation genetics. He developed 
a concept of genetic load — a 
measure of how fitness may 
be reduced by selection — and 
applied it widely, in particular to 
calculating how quickly natural 
selection would remove delete- 
rious mutations from a popula- 
tion. With Kimura in 1965, he 
quantified how sex accelerates 
the accrual of beneficial muta- 
tions in an evolving popula- 
tion. He refined the concept of 
effective population size and 
provided simple connections 
between selection at the genetic 
level (genotype) and at the physi- 
cally observed level (phenotype). 

Crow’s expertise drew him 
into the public-health sphere. He 
served ona National Academy of 
Sciences committee to assess genetic damage 
caused by radiation in victims of the atomic 
weapons dropped on Hiroshima and Naga- 
saki. And he advised the US government in 
the 1980s and 1990s on forensic DNA tests, 
leading to their widespread adoption in the 
courts. 

Born in 1916, in Phoenixville, Pennsylva- 
nia, Crow grew up in Wichita, Kansas, where 
his father taught biology at Friends Univer- 
sity. Crow took a bachelor’s degree there and 
moved to the University of Texas at Austin for 
his PhD in genetics, which he earned in 1941. 
After several years at Dartmouth College in 
Hanover, New Hampshire, he moved to the 
University of Wisconsin-Madison in 1948, 
where he remained for the rest of his career. 

He was interested in the pursuit of truth, 
and never advocated a particular opinion. 


This must be one reason why Crow and 
Kimura's 1970 textbook An Introduction to 
Population Genetics Theory has hada lasting 
role in the development of the field. 

Most of Crow’s experimental work was 
concerned with deleterious mutations, both 
de novo and segregating within populations. 
In addition, his lab made two landmark dis- 
coveries in Drosophila genetics: a locus called 
segregation distorter, which can manipulate 
Mendelian genetic segregation in meiosis, 
found by Yuichiro Hiraizumi and Larry San- 
dler; and P elements, which can cause high 


rates of mutation, found by William Engels 
(and independently by Margaret Kidwell, 
then at Brown University in Providence, 
Rhode Island). P elements are now widely 
used for genetic modification of fruitflies. 


CLEAR JUSTIFICATION 

Characteristically, Crow declined to be a 
co-author of papers reporting these two 
discoveries: he did not feel that providing a 
good working environment and sage advice 
was sufficient to warrant it. It was often hard 
to convince Crow to put his name ona paper 
even when this was clearly justified. 

Crow was very concerned about the 
negative impact on human health of mildly 
deleterious alleles that, due to advances in 
medicine, can accumulate almost unchecked 
by natural selection. Individually, these may 
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do little — causing a slight increase in blood 
pressure, for instance. But cumulatively their 
effect could prove fatal. Unlike some other 
prominent geneticists, Crow was always 
careful to separate science and policy and 
never advocated simplistic approaches to 
dealing with this problem. 

After his retirement in 1986, Crow largely 
devoted himself to writing about the early 
history of genetics and evolutionary biol- 
ogy. He was well placed, having had close 
long-term acquaintance with many pioneers 
— including Sewall Wright, Ronald Fisher, 
J. B. S. Haldane and Hermann Mul- 
ler. His command of the subject 
and fondness for it, his benevolence 
and fairness, attention to detail and 
beautiful prose mean that Crow’s 
publications remain required read- 
ing for anybody interested in this 
crucial phase. 

Crow was famous for his kind- 
ness, which coexisted with the 
firmness of his views. There was no 
better person to point out a grave 
error in your work — making him 
always in demand and appreciated 
by his undergraduate students. My 
dentist in Madison once completely 
forgot about my filling, when he 
realized that I worked in Crow’s 
laboratory. He still spoke of “Dr 
Crow” with awe some 25 years after 
taking one of Crow’s courses during 
pre-medical training. 

Those who knew Crow well will 
thoroughly miss his low-key benign 
humour, often directed towards 
himself. When asked about his religious 
views, he always cheerfully replied that the 
Bible is “important if true”. Still, he cher- 
ished the ethics of his Quaker heritage. Crow 
also explained to an insistent advocate of a 
healthy lifestyle that exercise prolongs life 
only by as much time as one spends exercis- 
ing, so he did not see a point in it. He always 
remained active — and died in his sleep. 

To borrow Crow’s phrase, originally about 
Wright: “They don’t make them like that any 
longer”. m 
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Spikes timed through inhibition 


Purkinje cells in the brain region known as the cerebellum act by inhibiting their target neurons. A paper in this issue 
provides an explanation for how this inhibition might be used to control the timing of action potentials. But experts are not 


equally convinced about the functional relevance of this finding. SEE LETTER P.502 


THE PAPER IN BRIEF 

@ Purkinje cells regulate the entire 

output of the cerebellar cortex — a brain 
area responsible for generating precise 
movement. 

@ They do so by sending inhibitory signals to 
nuclei (groups of neurons) lying deep within 
the cerebellum. 

@ So, while Purkinje cells are firing, the 
downstream nuclear neurons should be 
inhibited. 

@ However, the firing rate of Purkinje 

cells and nuclear neurons do not 


Time for action 


JAVIER F. MEDINA 


Ppa” and Raman’ identify the conditions 
that allow Purkinje cells to control the 
timing of spikes (action potentials) in their 
target cerebellar nuclear neurons with milli- 
second precision. This is very exciting news for 
several reasons. 

From a purely computational standpoint, 
the paper reveals a time-locking mechanism 
(Fig. 1) for neurons receiving inhibitory 
input from groups of cells that fire action 
potentials at high and irregular rates. This 
mechanism, which may be applicable to 
other inhibitory circuits in the brain, stands 
in stark contrast to the precise regulation of 
spike timing in another brain region — the 
cerebral cortex — by synchronous inputs that 
depolarize the neuronal membrane. It relies 
on several idiosyncratic features of cere- 
bellar nuclear neurons that were previously 
unknown, including a high intrinsic firing 
rate, ultra-fast inhibitory synapses (neuronal 
connections) and, as also suggested by others’, 
some degree of synchronous activity in the 
Purkinje-cell input. 

The findings are equally intriguing for 
our understanding of how the brain controls 
movement, especially given that a time-lock- 
ing mechanism seems perfectly suited to con- 
tribute to one of the most notable functions 


always vary inversely. 

@ Person and Raman! (page 502) propose 
that the answer to this conundrum lies in 
synchrony of firing. 

@ Through in vitro experiments, they 

find that, as expected, random firing of 
Purkinje neurons powertully inhibits 
nuclear cells. 

@ But when a small number of Purkinje cells 
fire synchronously, nuclear neurons fire 
action potentials that are ‘time-locked’ 

to the gaps between the inhibitory Purkinje 
inputs (Fig. 1). 


ascribed to the cerebellum — coordinating 
muscles with utmost accuracy and temporal 
precision. But does the cerebellum really use 
time-locking to control movement? After all, 
years of research have shown that much of 
the information sent to motor areas from the 
cerebellum is conveyed by a rate code? — a 
modulation in the firing frequency of neur- 
ons in which the precise timing of the spikes 
is irrelevant because the rate is averaged over 

tens to hundreds of milliseconds. 
Clearly, modulation of firing frequency 
has a crucial role in cerebellar processing. 
But before discard- 


The findings ing meer: of 

nuclear neurons as a 
cae equally computationally ele- 
intriguing gant, but otherwise 
Jor our functionally irrel- 
understanding evant, mechanism, 
of how the let’s remember that 
brain controls codes based on the 
movement. 


timing or the rate of 
spikes are not mutu- 
ally exclusive. Previous work has demon- 
strated that spikes in Purkinje cells less than 
100 micrometres apart can be precisely syn- 
chronized*”, and that the level of synchrony 
is dynamically regulated’. So, at least in 
theory’, this would allow inputs from dif- 
ferent groups of Purkinje cells to become 
synchronized for brief periods of time, and, 
when a high degree of temporal precision 
is required (for instance, at the beginning 
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or end of a movement), to control the exact 
timing of spikes in nuclear neurons. To con- 
trol other aspects of movement, such as 
amplitude or speed, the cerebellum might 
switch Purkinje cells back to the asynchronous 
mode and modulate their firing frequency up 
or down, thus regulating the firing rate of 
nuclear neurons. 

As with every landmark paper, several 
questions remain. For example, in addition 
to inhibitory synapses from Purkinje cells, 
nuclear neurons receive excitatory inputs 
from mossy and climbing fibres of neurons 
in other parts of the brain. How does the 
cerebellum process such opposing signals to 
achieve its goals? Examining the patterns of 
excitatory and inhibitory inputs in vivo, and 
understanding how they are integrated in 
nuclear neurons, should provide an answer. 

Another question arises from work® show- 
ing that cerebellar nuclear neurons can be 
classified into different functional groups 
according to not only the proteins they express 
or their particular anatomical connectivities, 
but also their electrical properties. Do these 
cell groups also differ in the way they process 
the inputs they receive? 

Finally, Person and Raman’s findings reveal 
what nuclear neurons can do, but not what 
they actually do. Does synchronization of 
Purkinje cells result in the spiking activity 
of nuclear neurons during normal cerebellar 
processing? The answer may come from 
whole-cell recordings in awake animals 
to check whether the membrane-voltage 
signature of individual nuclear cells in vivo 
is consistent with a spike-generating mecha- 
nism based on membrane repolarization 
after synchronous inhibition. As for the ques- 
tion of whether time-locking is part of the 
cerebellar code used for motor control, it will 
be necessary to record from many nuclear cells 
simultaneously, and to examine the timing 
and synchronization of their spikes relative to 
movement when the animal is active. 


Javier F. Medina is in the Department 

of Psychology, University of Pennsylvania, 
Philadelphia, Pennsylvania 19104-6241, USA. 
e-mail: jmed@sas.upenn.edu 


Sync or sink? 
KAMRAN KHODAKHAH 


lucidating the principles by which neurons 

and neuronal circuits ‘compute’ remains a 
cardinal objective in neuroscience. Through 
an ingenious set of experiments, Person and 
Raman’ describe one such computational 
principle: how a (nuclear) neuron decodes 
the temporal structure of its synchronous 
(Purkinje cell) inhibitory inputs while ignoring 
the asynchronous ones. The question is how 
relevant this temporal decoding mechanism 
is to cerebellar processing. 

Person and Raman demonstrate in vitro that 
synchronous activity of even two Purkinje- 
cell inputs is sufficient to modestly entrain 
(time-lock) the activity of their target nuclear 
neuron. They posit that temporal decoding 
is the mechanism by which nuclear neurons 
encode information in vivo. But a confounding 
limitation of such temporal decoding is that 
the probability of firing entrained spikes is 
reduced so much that nuclear neurons cannot 
rate-code — that is, the persistent activity of 
the asynchronous inputs reduces the probabil- 
ity of a nuclear neuron firing entrained spikes 
to such an extent that it can no longer encode 
the firing rate of its synchronous inputs in its 
own firing rate’. This is problematic, because 
it has been known*”* for more than 40 years 
that the activity rates of individual nuclear and 
Purkinje neurons correlate with movement 
and thereby with each other. 

Moreover, to date, no in vivo data have 
refuted the assertion that the firing rate of a 
nuclear neuron is inversely related to the aver- 
age firing rate of its synaptically connected 
Purkinje cells’. Therefore, even if several Pur- 
kinje cells could be perfectly synchronized 
in vivo, it is doubtful that, as implied’, nuclear 
neurons process and encode movement- 
related information through temporal 
decoding alone. 

It is quite possible that some sort of tempo- 
ral decoding occurs intermittently in vivo, for 
example during brief conditions when very 
precise timing of muscle movement is needed. 
However, the study’ suggests that it often takes 
several spikes from synchronously active 
Purkinje cells (requiring tens to hundreds of 
milliseconds) to entrain a nuclear neuron. It is 
therefore questionable whether a rate-coding 
nuclear neuron can rapidly switch modes and 
transiently time-lock to a few synchronous 
Purkinje-cell spikes. 

Because of the severely diminished prob- 
ability of firing of entrained spikes, the authors 
often had to artificially depolarize the nuclear 
neurons to promote spiking. But what depolar- 
izing inputs could permit temporal decoding 
in vivo without diminishing time-locking? A 
neuromodulator molecule may partially pro- 
vide sustained depolarization free of synaptic 
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Figure 1 | Decoding Purkinje-cell inputs. a, Purkinje cells of the cerebellum send inhibitory inputs 
(blue bars) to their target nuclear neurons. Consequently, nuclear neurons get little chance to fire action 
potentials (red lines). b, Person and Raman' report that if two or more inhibitory inputs from Purkinje 
cells are synchronized (longer bars), it creates more opportunities for nuclear neurons to fire action 
potentials in the gaps between such ‘bundles’ of inhibitory inputs, entraining their spiking. (This graphic 


is not an exact representation of the authors’ data.) 


‘noise; although there is no evidence for this 
premise. The most likely depolarizing source is 
the excitatory inputs from mossy and climbing 
fibres. These inputs powerfully modulate the 
firing of Purkinje cells in vivo, and their collat- 
eral synaptic inputs to nuclear neurons should 
also depolarize the latter. But the strength of 
these inputs changes 
with movement, and 


The question is ; 
how relevant tisy Wall an@ eenet: 
this temporal ia aes seen 
deco ding n vivo, t wate inputs 
Psd are therefore likely to 
mechanism ts significantly diminish 
to cereb ellar the precision of the 
processing. modest time-locked 


spiking seen in vitro’. 

Linking in vitro and in vivo studies is always 
challenging, and in this case it is unclear 
whether nuclear neurons can temporally 
decode synchronous activity of a small num- 
ber of Purkinje cells in vivo. With strong elec- 
trical stimulation of the cerebellar cortex, 
Person and Raman’ could evoke time-locked 
responses. But estimating the number of acti- 
vated Purkinje cells in these experiments is 
challenging, and electrical stimulation does 
not have any specificity for activating cell 
types or axonal processes of neurons. A more 
rigorous test of temporal decoding using the 
technique of optogenetics will allow selective, 
synchronous activation of a few Purkinje cells 
in vivo, while eliminating complications that 
arise from the unavoidable electrical activation 
of mossy and climbing fibres. 

Lastly, even if a small number of synchro- 
nized Purkinje cells could modestly promote 
entrainment of nuclear neurons in vivo, 
complete synchronous activity of even two 


Purkinje cells is improbable. When examined 
pairwise in vivo during brief periods of high 
synchrony, the correlation between the firing 
of two Purkinje cells is of the order of a few 
per cent’®*”’ — tenfold lower than the 100% 
synchrony assumed here’. To what extent 
nuclear neurons can meaningfully entrain 
when a few Purkinje cells are only partially 
synchronized remains to be established. 

The new computational principle described 
by Person and Raman unveils a powerful tool 
in the brain’s information-processing tool kit. 
Future experiments will undoubtedly identify 
the circuits in which this tool is used, and may 
outline the conditions under which it contrib- 
utes to motor coordination by the cerebellum. m 
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ASTRONOMY 


A new class of planet 


Three examples of a new family of planets, which orbit a pair of stars rather than 
a single one, have been discovered. The Milky Way may contain millions of these 


circumbinary planets. SEE LETTER P.475 


JOHN SOUTHWORTH 


Ithough more than 700 extrasolar 

planets have been detected, none was 

known to orbit more than one star until 
the recent discovery’ ofa circumbinary planet, 
which orbits a pair of stars. This concept was 
previously confined to theory — and to sci- 
ence fiction, for example the planet Tatooine 
in Star Wars. On page 475 of this issue, Welsh 
et al.” describe the discovery of two more such 
planets and provide insight into their fre- 
quency of occurrence. The previously discov- 
ered planet’ and the new ones, each of which 
orbits its own system of two stars, were found 
using NASA’s Kepler space telescope”. 

For hundreds of years, scientists assumed 
that the Solar System is a typical example of a 
planetary system. That assumption was chal- 
lenged in 1995 by the discovery’ of 51 Pegasib, 
the first planet to orbit a normal star other than 
the Sun. Although this planet is probably a gas 
giant (the lower limit on its mass is 0.47 Jupiter 
masses), it orbits at only 0.052 astronomi- 
cal units (av) from its star (1 Au is the 
average distance between Earth and 
the Sun). This means that 51 Pegasib is 
100 times closer to its star than Jupiter 
is to the Sun. 

Planet 51 Pegasi b was discovered 
through precise measurements of the 
velocity ofits parent star, which revealed 
the motion induced in the star by the 
presence of the orbiting planet. This 
method has proved very successful for 
spotting planets, and can be credited 
with the discovery of roughly 400 so 
far. As observational programmes con- 
tinue, they become sensitive to planets 
on wider orbits (longer orbital periods). 
The dominant population of extraso- 
lar planets currently consists of objects 
that are more massive than Jupiter and 
are separated from their host stars by 
several astronomical units; many of 
these are in multi-planet systems. 

The other very successful method for 
discovering planets is to look for those 
that periodically transit (eclipse) their 
parent star. These transiting planets are 
a gold mine of information: they are the 
only ones whose size can be obtained, 
by measuring the amount of starlight 


“This article and the paper? under discussion 
were published online on 11 January 2012. 
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blocked during transit. This means that their 
surface gravities and mean densities can be 
calculated, ultimately allowing the investiga- 
tion of their internal structure and formation 
process. 

The transit method has led to the discovery 
of more than 200 planets, predominantly by 
teams that operate small wide-field robotic 
survey telescopes at observatories spread 
around the world, such as HATNet’ and 
SuperWASP”. These surveys are heavily biased 
towards large planets with small orbits. As a 
result, they are unparalleled sources of odd- 
balls such as WASP- 17 (ref. 6), the biggest and 
most rarefied planet known (up to twice the 
radius of Jupiter and only 6% as dense), and 
WASP-18 (ref. 7), which is ten times the mass 
of Jupiter and whirls around its host star every 
23 hours (Jupiter's orbital period is 11.9 years). 

The overriding aim of planetary research is 
to find one that might support life. Habitability 
most probably requires a rocky surface with 
liquid water, which, in turn, demands a planet 
no bigger than two Earth radii on an orbit with 


Figure 1 | Orbital configuration of the Kepler-34 system. The 
outer ellipse represents the orbital motion of the circumbinary 
planet Kepler-34 b, labelled b, around its host binary star system, 
which is composed of stars A and B in orbit around one another 
(as indicated by arrows). The plus sign shows the system's centre 
of mass. Spheres denote the orbital positions of the three bodies. 
One astronomical unit (AU) is the average distance between Earth 
and the Sun. Gravitational effects between the three bodies mean 
that this orbital configuration is gradually changing, so the bodies 
follow different paths on successive orbits. This is why the orbit 
of the planet shows a discontinuity in the upper part of the figure. 
Kepler-34 b is one of two circumbinary planets discovered by 
Welsh and colleagues”. (Modified from ref. 2.) 
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a period of hundreds of days. The transits of 
such a body will not only be infrequent but 
will cause the light from the star to drop by a 
puny 0.01%. Such a signal is much too mea- 
gre to pick up with ground-based telescopes, 
which suffer from the blurring effect of Earth’s 
atmosphere, as well as inevitable interruptions 
due to daylight and bad weather. 

Finding a habitable planet requires a 
larger — and much more expensive — tele- 
scope outside Earth’s atmosphere. Enter the 
Kepler spacecraft, the primary aim of which is 
to use the transit method to discover Earth-like 
planets. It monitors 150,000 stars in the con- 
stellations Cygnus and Lyra, and has already 
found more than 2,000 candidate transiting 
planets. Three of these have been confirmed to 
be circumbinary planets: Kepler-34 b (Fig. 1) 
and Kepler-35 b, which Welsh et al. describe in 
their study’, and Kepler-16 b (ref. 1). Not only 
does each of these three planets transit both of 
its parent stars, but the stars themselves eclipse 
each other. 

Although the discovery’ of Kepler-16 b 
revealed that it was possible for such an object 
to exist, Welsh and colleagues’ identification 
of two more circumbinary planets not only 
shows that such a planet is no freak object, 
but also allows an estimate of their prevalence 
to be made. The authors’ find that, for short- 
period binary star systems, the frequency 
of occurrence of circumbinary planets is at 
least 1%. Taking into account the fraction 
of stars that are short-period binaries, this 
result implies that there are millions of such 
planets distributed throughout the 
Galaxy. This analysis does not account 
for longer-period binary star systems, 
which are similarly plentiful in the 
Galaxy. 

Some circumbinary planets may even 
be habitable, although the three known 
ones are not. Kepler-16 b is slightly too 
cold, and Kepler-34 b and Kepler-35 b 
are too hot. They also have extreme 
seasons because the light received from 
their parent stars changes not only dur- 
ing the stars’ orbital periods (tens of 
days) and the planetary orbital period 
(hundreds of days), but also on much 
longer timescales through precession of 
the orbits due to three-body effects. 

What common characteristics do 
these three planets have? The central 
binary systems have orbital separations 
of between 0.18 and 0.22 au, and the 
planets orbit their hosts at distances of 
between 0.6 and 1.1 au. They are thus 
all close to the smallest possible stable 
orbits, but the fact that such planets 
were the first to be found is at least 
partly an effect of the detection method. 
As Kepler continues to observe, it will 
become sensitive to planets on longer 
periods: these three systems may 
represent only the tip of the iceberg. m 
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Hunter-gatherer 


cooperation 


A study of social networks in the hunter-gatherer Hadza people in Tanzania 
illuminates the evolutionary origins of humans’ unique style of cooperation 


in groups. SEE LETTER P.497 


JOSEPH HENRICH 


he social behaviour of humans poses a 
| significant evolutionary puzzle. Influ- 
enced by ‘prosocial’ motivations, we 
routinely help our relatives and friends in 
ways big and small, from donating kidneys to 
sharing food. Perhaps most puzzlingly, and 
unlike other primates, we also help strangers 
and cooperate in large groups by, for exam- 
ple, giving blood, going to war, recycling and 
paying taxes. Yet human prosocial behaviour 
varies dramatically between groups — from 
societies with little cooperation beyond 
extended kin to the vast scales of cooperation 
found in many modern states’. 

Among the key challenges to understand- 
ing the origins of human cooperation are the 
difficult questions of what the social lives of 
our Palaeolithic ancestors were like, and how 
they shaped our psychology. Studying mod- 
ern foraging populations who depend on tools 
and resources similar to those of our ances- 
tors is one of the few means we have of glean- 
ing certain kinds of insight into the past’. On 
page 497 of this issue, Apicella et al.’ give usa 
glimpse into the social dynamics of one of the 
few remaining populations of nomadic hunter- 
gatherers, the Hadza of Tanzania (Fig. 1). 

With its practical implications, human coop- 
eration stands as a central question that spans 
the behavioural sciences. From an evolutionary 
perspective, the conundrum is how individu- 
als sustain cooperation in the face of the ever- 
present forces of self-interest; in other words, 
how could natural selection favour actions 
that benefit others, or one’s group, but that 
also incur a personal cost? Theorists generally 
agree that the solution to this core dilemma 
requires assortative interaction, such that 
cooperators benefit other cooperators more 
than non-cooperators’. The challenge arises 
when one tries to delineate the processes that 


sustain such assortative interactions against 
invasion by ‘free-riders’ — non-cooperators 
who siphon off collective benefits. 

Through the 1970s and 1980s, many 
researchers assumed that hunter-gatherers 
tackle this core dilemma by relying on a com- 
bination of kinship and direct reciprocity. By 
targeting kin on the basis of shared genetic 
inheritance, cooperators are more likely to 
deliver benefits to fellow cooperators. Simi- 
larly, by reciprocating help with help, unrelated 
individuals can sustain tit-for-tat coopera- 
tion. However, by the twenty-first century 
it had become clear that although kinship 
and direct reciprocity can each explain some 
aspects of human prosociality, many domains 
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of cooperation, ranging from the sharing 
of meat within bands of hunter-gatherers to 
territorial defence, cannot be easily accounted 
for by these models’. 

To address this gap, researchers began to 
develop and apply new models. Gene-culture 
co-evolutionary models propose that, because 
social strategies are culturally learned, rapid 
cultural change tends to generate substantial 
variation in cooperation among groups while 
reducing variation within groups. In these 
models, cooperation is sustained by a process 
of cultural learning and the sanctioning of 
norm violators, which leads to the continuous 
reassortment of groups®. More cooperative 
groups tend to endure and expand, whereas 
less cooperative groups gradually break down. 
Another class of models, based on social selec- 
tion, proposes that individuals cooperate com- 
petitively, as a means of attracting an inflow of 
partners who bring benefits’. A third approach 
proposes that cooperation can be sustained as 
individuals seek out those with different skills, 
resources or abilities. Here, assortment is based 
on complementarity rather than similarity’. 

To illuminate how the Hadza tackle the 
core dilemma of cooperation, Apicella et al.* 
gathered data on assortment and coopera- 
tive tendencies. The authors studied assort- 
ment within two social networks. To assemble 
the first (a campmate network), they asked 
adult Hadza from 17 different bands who 
they wanted to camp with when their next 
band formed. For the second network (a gift 
network), individuals received three honey 
sticks — Hadza love honey — and were asked 
to secretly specify who should get each stick. 
Finally, to measure cooperativeness, the 
researchers gave individuals from each band 


Figure 1 | Helping hands. The Hadza people of Tanzania, such as these young men who are 

roasting birds they have caught, rely on hunting and gathering to obtain most of their food. By studying 
Hadza social networks, Apicella et al.* illuminate the population dynamics that underpin the evolution 
of human cooperation. 
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four additional honey sticks, and told them that 
any stick that was contributed to a common 
pool for their band would be tripled and the 
sticks distributed equally among band mem- 
bers. The Hadza could anonymously contrib- 
ute any number of these four sticks (from zero 
to four), thereby pitting self-interest against 
the common good. Cooperators contribute 
sticks to the pool whereas egotists con- 
tribute nothing and free-ride on others’ 
contributions. 

Let’s begin by considering those models that 
are not supported by the authors’ results. They 
find that Hadza do not preferentially pick more 
cooperative individuals as future campmates or 
stick-receivers. They also do not preferentially 
network with those possessing complemen- 
tary attributes, at least as indicated by age, food 
preferences or various physical measures. Thus, 
these findings do not favour existing models 
based on social selection or complementarity. 

On the positive side, the most striking find- 
ings emerge when the variation in cooperative 
behaviours is partitioned within and among 
the 17 Hadza bands. There is substantially 
more variation among the bands, and substan- 
tially less variation within them, than would 
be expected by chance. Despite the fluidity 
of band membership, it seems that some 
combination of similarity-based association, 
social learning and sanctioning establishes 
differences in cooperative tendencies among 
different bands. This pattern is particularly 
interesting in light of experiments’ showing 
that larger Hadza bands evince more fairness in 
anonymous interactions. Consistent with this, 
Apicella and co-workers’ data from both the 
campmate and gift networks suggest that high 
contributors associate with other high contrib- 
utors, and low contributors choose other low 
contributors. In fact, the gift-network results 
indicate that this extends to friends of friends: 
if your friend’s friend is highly cooperative, you 
are likely to cooperate more, too. 

As is the case in other primates"’, Apicella 
et al. also found that kinship and reciprocity 
contribute to assortment in Hadza social net- 
works. No surprises there. 

The crucial insight from this work’ is that 
understanding distinct aspects of cooperation 
among these hunter-gatherers must incorpo- 
rate an analysis of the dynamic processes at the 
population level that influence association, 
cultural transmission and band formation, 
instead of focusing tightly on purely individual 
actions within bands — the emphasis of much 
previous work. = 
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Shape matters 


The ligand- mediated binding of colloid particles to each other is more effective 
if the particles are flat rather than curved. This finding opens up opportunities 
for the design of self-assembling materials. 


SHARON C. GLOTZER 


stacking of cannonballs and the design of 

stealth aircraft, humans have long known 
that shape matters. On a much smaller scale, 
the shapes of molecules affect their ability to 
form crystals, and enzyme shape is central 
to the binding of their substrates. Writing in 
the Journal of the American Chemical Society, 
Mirkin and colleagues’ report a way in which 
shape can also affect the binding forces that 
hold nanometre-scale particles together — a 
discovery that suggests new approaches for 
constructing potentially useful architectures 
from these tiny building blocks. 


| pies the invention of the wheel to the 


Gold nanoparticle 


Metallic and semiconductor nanoparticles 
growas tiny crystals from solutions of precur- 
sor ions. Facets arise naturally from the aniso- 
tropic (directionally dependent) growth of the 
nanocrystals, producing particles that have a 
range of convex and concave shapes. These 
nanocrystals can be stabilized by coating them 
with small organic ligand molecules, or modi- 
fied by the attachment of larger molecules such 
as DNA. Ligand coatings can conspire with the 
atoms in nanoparticles to produce net inter- 
particle forces (either attractive or repulsive) 
through van der Waals, hydrophobic and 
electrostatic interactions. DNA modification 
can also confer specificity on inter-particle 
forces — particles to which single-stranded 


Gold surface 


Figure 1 | DNA-mediated binding of nanoparticles. Mirkin and colleagues’ prepared nanoscale gold 
particles of different shapes to which DNA molecules were attached. Although the main bodies of these 
molecules were duplexes (turquoise), the free ends of the DNA were single-stranded ‘sticky ends’ (pink). 
These could bind the particles together by forming duplexes with complementary sticky ends on other 
particles. The authors observed that particles with flat surfaces bound to each other more strongly than 
did spherical nanoparticles. This effect depends on how easily the sticky ends on different particles 

can approach each other. a, On spherical particles, relatively few sticky ends can come together to form 
duplexes, and the DNA molecules need to bend to allow duplex formation. b, When attached to flat 
surfaces, the DNA molecules can align so that more sticky ends form duplexes, without any bending. 
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DNA is attached bind only to other particles 
that have complementary DNA sequences 
appended, because of the resulting DNA duplex 
formation. 

Depending on their interactions, anisotropi- 
cally shaped nanoparticles can serve as build- 
ing blocks for self-assembled materials suitable 
for many applications, including solar cells, 
computer chips and even camouflage. There 
are various recent examples of nanoparticles 
whose anisotropic shape controls the struc- 
tures that assemble from them: octopods’, 
cuboctahedra (shapes formed by cutting the 
corners off a cube to create a polyhedron that 
has only squares and triangles for faces) and 
related polyhedra’, and rounded squares and 
cubes*®, to name just a few. 

Mirkin and co-workers’ now add to our 
knowledge of the shape dependence of nano- 
particle self-assembly. They made triangular 
gold nanoprisms — essentially, two-dimen- 
sional triangular objects — whose faces were 
modified with a layer of single-stranded DNA. 
When the authors combined these nano- 
triangles with others bearing complementary 
DNA, the particles ‘hybridized’ quickly and 
strongly with their counterparts. The research- 
ers observed that particle-particle binding was 
several million times stronger, and happened 
100 times faster, than in the case of similarly 
modified gold nanospheres. 

The reason the triangular-shaped particles 
bind so much better to each other than do the 
spheres is just what one might expect. Mirkin 
and colleagues found that the large, flat faces 
of triangles align to allow many more DNA- 
DNA binding interactions than are possible 
for the curved surfaces of the spheres (Fig. 1). 
Additionally, the authors observed that the 
binding affinity of each DNA molecule for 
its complementary strand seems to be higher 
for the flat particles than for the round ones, 
so that greater numbers of duplex linkers can 
form per unit area. Finally, they found that flat 
surfaces allow many complementary DNAs 
to form duplexes without first having to bend; 
such bending is necessary for the binding of 
multiple ligands of similar length attached to 
two curved surfaces. It is remarkable that sim- 
ple geometrical principles are at play even on 
such small length scales. 

Intuitively, plate-like particles of any shape 
should be just as effective as triangles in maxi- 
mizing the available interaction area for the 
ligands. In fact, any particles that have flat 
facets should bind together more easily using 
ligands than do spheres. And it is not just 
ligands such as DNA — which bind to each 
other through a precise combination of inter- 
actions — that should be able to maximize 
ligand-ligand interactions, and thus particle- 
particle interactions, when attached to flat 
surfaces. Ligands of any sort that attract each 
other should similarly benefit. 

Mirkin and co-workers demonstrated this 
generality using ligands that terminate in 
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carboxylic acid groups (COOH) in place of 
DNA. In this case, hydrogen bonds between the 
acid groups allow particle—particle association. 
The authors again observed that inter-particle 
attraction was considerably stronger for tri- 
angular gold particles than for spherical ones. 
Faceted particles could thus offer a new way 
of making ‘patchy’ particles’, which contain 
interaction sites in patches at prescribed loca- 
tions. By varying the patterns of the patches, 
the self-assembly of such particles can be 
controlled to make particular structures. 

Shape is also beginning to be used to 
control depletion interactions, which occur 
when entropy effects cause small particles 
or polymers to be excluded from the spaces 
between larger colloid particles. This exclusion 
increases the osmotic pressure outside — and 
so provides an effective attraction between — 
the larger particles. Micrometre-scale discs 
stack up* readily into columns when mixed 
with nanometre-scale micelles in water, dem- 
onstrating that the discs bind more strongly 
face to face through depletion, rather than 
edge to edge or edge to face. Similar principles 
have been used for the assembly of nanoscale 
octahedra’. The idea of matching the shapes 
of colloid particles to maximize the strength 
of depletion interactions between them 
has also been applied to particles that have 
curved surfaces, allowing the preparation of 
Pac-Man-shaped particles that specifically 
bind smaller spheres in their dimple-shaped 
‘mouths”. 

Shape and the alignment of facets are crucial 
factors in particle assembly, even when the 
only interactions between particles are those 
that prevent them from overlapping. For exam- 
ple, computer simulations have shown that 
hard, regular tetrahedrons”’ and triangular 
bipyramids" each self-assemble into quasi- 
crystals (aperiodic crystals) once the fraction 
of space occupied by the particles within a 
given volume exceeds roughly one-half. 

The thermodynamically stable assemblies 
of truncated tetrahedra — a series of polyhedra 
formed when the four tips of a tetrahedron are 
progressively truncated until an octahedron is 
produced — are also dramatically influenced 
by the precise shape of the objects”. The result- 
ing crystals can be quasicrystalline, or can have 
a similar structure to the atomic structures of 
diamond, of the metallic form of tin (B-tin) or 
of high-pressure lithium. They can also form 
what are known as body-centred cubic crys- 
tals. Taken together, these assemblies represent 
the most complex and diverse set of structures 
ever predicted to form from hard shapes. 

In contrast to the alignment of nanoprisms 
discovered by Mirkin and colleagues’, which 
is driven by the minimization of potential 
energy, the faces of hard polyhedra align to 
maximize entropy. This results in entropi- 
cally ‘sticky’ patches that effectively provide 
attraction between particles without the 
need for ligand binding. Such behaviour is 
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possible only when the particles are confined 
toa fixed volume or pressure, but it serves as 
another recently discovered example of how 
shape matters. 

In fields ranging from biology to engineer- 
ing, the importance of shape has long been 
appreciated, but not always fully understood. 
Through work such as that of Mirkin and co- 
workers, we are starting to get a sense of how 
shape affects objects on the nanoscale — and 
so a glimpse of the shapes of things to come. m 
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Even harder X-rays 


With the laser just over half a century old, another dream of the pioneers of this 
light source has been fulfilled. An atomic X-ray laser with unprecedentedly high 
photon energy has been demonstrated. SEE LETTER P.488 


JON MARANGOS 


riting in this issue, Rohringer et al.' 

report the first demonstration of 

atomic X-ray lasing at a photon 
energy of 849 electronvolts. The X-ray laser 
emission in this case is based on atomic pop- 
ulation inversion, which arises when more 
members of an ensemble of atoms exist in 
a higher-energy state than in lower-energy 
states. Rohringer and colleagues created 
population inversion in a sample of neon gas 
using a device known as an X-ray free-electron 
laser, operating at 960 eV. Although its pho- 
ton energy falls in the ‘soft’ X-ray part of the 
electromagnetic spectrum, the authors’ X-ray 
source paves the way to making practical 
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Figure 1 | Atomic X-ray lasing. Rohringer et al.’ have demonstrated X-ray 
lasing at a photon energy of 849 electronvolts by creating atomic population 
inversion in a sample of neon gas using the Linac Coherent Light Source 
(LCLS), which is an X-ray free-electron laser. a, The LCLS X-ray radiation has 
a large spread in photon energy and considerable fluctuation in the average 
photon energy (about 960 eV) obtained from sequential laser pulses (shown 
in different colours). Photon count is measured after transmission through 


1,000 


atomic lasers in the ‘hard’ X-ray regime (ener- 
gies beyond 5 keV). 

The X-ray free-electron laser (FEL) used 
by Rohringer and colleagues was the Linac 
Coherent Light Source (LCLS) at the SLAC 
National Accelerator Laboratory in Menlo 
Park, California. The LCLS, which is based 
on self-amplified spontaneous X-ray emis- 
sion from a high-quality, high-energy (up 
to 14 GeV) electron beam, was essential for 
creating the population inversion; that is, for 
‘pumping’ the new atomic X-ray laser. 

But doesn’t the need to use an X-ray FEL 
somewhat undermine the authors’ new laser 
demonstration? Yes and no. The LCLS has 
already generated’ laser-like X-rays of unprec- 
edented brightness — having a photon energy 
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that can be tuned from 500 eV to more than 
8 keV in pulses lasting 5-80 femtoseconds 
(1 femtosecond is 10°'’s) and containing up 
to 10’* photons. The LCLS has enabled the 
first laboratory studies” of matter exposed to 
intense keV-energy X-rays, and has been used 
to image single nanocrystals° and viruses’. 
Nevertheless, like any FEL based on self-ampli- 
fied spontaneous X-ray emission, the LCLS has 
inherent limitations in the quality of its output 
radiation. These limitations are principally due 
to the stochastic nature of the process, which 
leads to poor temporal coherence (meaning 
that the waves comprising the X-ray field are 
not well synchronized) and strong spectral and 
temporal jitter from sequential laser pulses. 

It has long been common practice to use 
one type of visible-light laser to provide the 
energy for pumping the population inver- 
sion needed to operate another laser that has 
different optical properties. For example, 
nanosecond visible lasers are regularly used 
to pump femtosecond visible lasers. In their 
study (page 488), Rohringer et al. extend this 
idea to the X-ray region of the spectrum. Ear- 
lier X-ray lasers operating at lower photon 
energies, from 20 to 300 eV (ref. 7), have been 
pumped by physical processes, such as electron 
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the neon sample and is strongly attenuated. b, The LCLS ‘pumps’ many of the 
neon atoms from the ground state (state 0) into a higher-energy state of ionized 
neon (state 2). Most of the ions in this excited state decay through an Auger 
process, but some will make the radiative transition to a state (state 1) that has 

a lower energy than state 2. This transition is accompanied by the emission of 
laser radiation that has a precise average photon energy (849 eV). c, The laser 
radiation has a smaller energy spread than that of the LCLS. 


collision, recombination or photoionization, 
in plasma media that are created by either 
electrical discharges or high-intensity visible 
lasers. These laser systems are limited because 
the energy density needed to pump population 
inversion scales as the cube of the laser photon 
energy. Therefore, a very high energy density 
is required to reach the harder X-ray range. 
Indeed, population-inversion hard X-ray 
lasers pumped by the extremely high energy 
density produced in thermonuclear explosions 
were the target of the Excalibur national- 
defence project at the Lawrence Livermore 
National Laboratory in the 1980s*. But these 
X-ray sources are of rather restricted general 
application. 

In their experiment, Rohringer and col- 
leagues focused LCLS X-ray pulses each 
containing more than 10’” 960-eV photons 
and lasting just 40 fs into a beam a few micro- 
metres across that impinged on a dense sam- 
ple of neon atoms. This produced the energy 
density required to pump many of the atoms 
from the ground state (state 0) into a higher- 
energy state of ionized neon (state 2) and attain 
population inversion. Most of the ions in the 
excited state decay by a mechanism known 
as an Auger process on a timescale of about 
2.7 fs. But some of them make the radiative 
transition to a state (state 1) that has a lower 
energy than state 2. Because the applied LCLS 
X-ray radiation leads to the sudden creation of 
a large population of neon ions in state 2 and 
an equally rapid depletion of state 1, there isa 
transient population inversion between 2 and 
1 that leads to lasing (Fig. 1). 

The most important property of the authors’ 
laser is that the emitted X-ray pulses have 
a precise central energy that has a spread of 
less than 1 eV and that is tied to the atomic 
properties of the neon ions. These properties 
are a result of the rules of quantum mechan- 
ics and so do not vary from pulse to pulse. By 
contrast, the energy spread of the LCLS X-ray 
pulses that impinge on and are transmitted 
through the neon gas is 8 eV on any given 
pulse, and nearer to 15 eV when averaged 
over many pulses (Fig. 1). The authors’ mod- 
elling of the process suggests that the energy 
spread is consistent with the physical limit set 
by the short duration (about 5 fs) of the pulses 
emitted from the neon sample. The smaller 
energy spread of the emitted X-rays means 
that the temporal coherence of the 849-eV 
X-rays is more than tenfold greater than that of 
the LCLS pulses. 

Although Rohringer and colleagues’ X-ray 
laser has a lower output power than that of 
LCLS radiation, the greatly improved coher- 
ence and reduced energy spread will open new 
areas of research that demand a well-defined 
X-ray energy. Examples of these areas include 
the study of physical processes such as pho- 
toionization and inelastic X-ray scattering, 
which can be used to study ultra-fast changes 
in matter. Moreover, the authors’ X-ray laser 


pulses and the LCLS pulses are closely syn- 
chronized and so can be used in experiments 
in which two X-ray fields of differing photon 
energy are required to interact simultaneously 
with a sample. 

Not only does the authors’ X-ray-FEL- 
pumped scheme operate at a much higher pho- 
ton energy than achieved by other approaches 
to producing X-ray lasers based on population 
inversion, but also the repetition rate (the rate 
at which X-ray pulses are produced) is as high 
as that of the FEL used. The LCLS has a repeti- 
tion rate of up to 120 hertz, which is more than 
100-fold higher than that of any previous X-ray 
laser. Although the new laser is more difficult 


STEM CELLS 
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to operate than the LCLS, its photon-energy 
range, stability and repetition rate make it of 
considerable potential utility in time-resolved 
structural studies of matter. = 
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The right neighbour 


Different cell types produce signals that regulate the activity of blood- forming 
stem cells. A study shows that certain rare mesenchymal cells surrounding blood 
vessels are the main source of one such signal in mice. SEE ARTICLE P.457 


ILYA A. SHESTOPALOV & LEONARD I. ZON 


oping with a lifetime of tissue function, 
( injury and disease requires replenish- 

ment of the cells that make up organs. 
Replacement cells originate from adult-tissue 
stem cells such as haematopoietic stem cells, 
which continually form all types of blood cell. 
The growth and activity of tissue stem cells is 
regulated by neighbouring cells that comprise 
the stem-cell niche — a microenvironment 
containing regulatory signalling molecules. 
Identifying, controlling and mimicking the 
niche signals represent challenges for the 
emerging field of regenerative medicine. On 
page 457 of this issue, Ding et al.' identify the 
cells that produce a signal called stem-cell 
factor, which is essential for the generation of 
new blood cells from haematopoietic stem cells 
in embryonic and adult mice. 

The bone-marrow niche contains small 
blood vessels called sinusoids (Fig. 1) made 
up of endothelial cells together with a variety 
of immune cell and the axonal processes of 
peripheral neurons’. The rest of the bone 
marrow is filled with mesenchymal cells of vari- 
ous functions. For example, some (the stromal 
cells) provide connective tissue, others (osteo- 
blasts) replenish or remodel the inner bone 
surface. Unlike the majority of stromal cells, 
arare subset that originates from the neural- 
crest tissue of the embryo expresses the protein 
nestin and influences the function of 
haematopoietic stem cells (HSCs)’. 

To understand the niche microenviron- 
ment, it is necessary to identify the cell types 
that provide the regulatory signals and to estab- 
lish which signals each produces. For example, 


Haematopoietic stem cell 


Endothelial 
cell 


LEPR-expressing 
mesenchymal cell 


Figure 1 | A home for blood-forming cells. 
Haematopoietic stem cells (HSCs) generate all 
types of blood cell. In mammals, they reside in 

the bone marrow, where blood vessels made 

of endothelial cells form microscopic cavities 
called sinusoids. Endothelial cells are surrounded 
by connective tissue composed of different 
mesenchymal cells, which can produce molecular 
signals that control HSC growth. Ding et al. 

find that in mouse bone marrow, the molecular 
signal stem-cell factor (SCF), which is required to 
sustain HSCs, is primarily produced by a type of 
mesenchymal cell that expresses the leptin receptor 
(LEPR). Endothelial cells also contribute some SCE. 


stem-cell factor (SCF) is secreted by endothe- 
lial cells, osteoblasts and nestin-expressing 
stromal cells**®. Although previous research’ 
has shown that depletion of many cell types in 
the bone-marrow niche produces measurable 
effects on HSCs, the causality of the observed 
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TECHNOLOGY 


A deeper peek into living organisms 


In two papers in this issue, Lechene and 
colleagues!” report the first use of an 
approach called multi-isotope imaging mass 
spectrometry (MIMS) in living organisms 
(see pages 516 and 520). This technique 
has outstanding resolution: it provides 

data in the sub-micrometre range, allowing 
analysis of structures as small as cellular 
regions. 

MIMS involves labelling living tissues with 
stable isotopes. The isolated sample surface 
is then bombarded with a beam of ions, and 
the ejected ‘secondary’ ions are measured 
with a mass spectrometer to determine 
the sample’s molecular composition. The 
technique can distinguish between ions 
of very similar mass, providing a precise 
measurement of isotope labels, which can be 
imaged simultaneously. 

Lechene and co-workers used MIMS 
to test the immortal-strand hypothesis, 
which proposes that asymmetrically 
dividing stem cells also segregate their 
DNA asymmetrically. That is, the daughter 
cells that will remain stem cells retain the 
older DNA template, whereas those that are 
committed to differentiation inherit newly 
synthesized DNA strands. The authors 


changes has been difficult to ascertain. 

To establish the relative importance of the 
various SCF-producing cells in HSC growth, 
Ding et al.' developed a mouse model in which 
the Scf gene could be replaced with a gene 
encoding green fluorescent protein, in all or in 
selected cell types, at different times during the 
animal's life. In this way, cells that would nor- 
mally express Scf became fluorescent instead. 
Using this mouse model, Ding et al. found 
that, when they abolished SCF production in 
all cell types, HSCs disappeared from the bone 
marrow. The authors then removed Scf from 
specific cell types in the HSC niche, and found 
that loss of Scf from blood cells, osteoblasts 
and nestin-expressing mesenchymal cells did 
not alter HSC abundance in the bone mar- 
row. Mice with Scf-lacking endothelial cells, 
however, had fewer HSCs during embryonic 
development, and only some HSCs remained 
in these mice during adulthood. 

The researchers went on to identify a type 
of mesenchymal cell that surrounds sinusoids 
in adult bone marrow and that, in contrast to 
other niche cells, expresses the gene Lepr, which 
encodes the leptin receptor. The LEPR protein 
regulates fat metabolism in some cell types, but 
its function in bone-marrow mesenchymal cells 
is unknown. Ding et al. observed that loss of 
SCF from Lepr-expressing mesenchymal cells 


disprove this proposal, showing that DNA 
strands segregate randomly in proliferating 
crypt cells of the mouse small intestine 
(pictured). This finding should further our 
understanding of tissue homeostasis. 

The researchers also analysed protein 
turnover in the mechanosensory hair cells 
in the inner ear of frogs and mice. During 
most vertebrates’ lifetime, hair cells are not 
replaced, but their degraded proteins are. 
One kind of structure within these cells is 
the stereocilia, each of which is made up of 
hundreds of filaments of the protein actin. 
Lechene and collaborators quantified actin 
turnover in both adult and neonatal hair 
cells and report that, with the exception of 
the filaments’ tips, this protein’s turnover is 
particularly slow throughout stereocilia. 
This observation differs from previous 
findings® that stereocilium actin has a rapid 
turnover time. According to Lechene and 
co-authors, this discrepancy may be due 
to differences in experimental conditions 
between the two studies. 

The team also demonstrates successful 
use of MIMS for human studies. They 
thus not only further prove the broad 
applicability of their technique, but also 


reduced HSC abundance in adult mice. More- 
over, when the authors deleted the Scf gene from 
both endothelial and Lepr-expressing cells, 
nearly all HSCs disappeared from adult mice. 
These results indicate that, in the bone marrow 
of adult mice, SCF comes primarily from Lepr- 
expressing mesenchymal cells that envelop 
sinusoids, with a smaller contribution of SCF 
expression from sinusoid endothelial cells. 

Future studies are needed to characterize the 
functions of the Lepr-expressing mesenchy- 
mal cells in the HSC niche. According to Ding 
and colleagues’ gene-expression data’, these 
cells produce another HSC-regulating signal, 
stromal-cell-derived factor 1 (SDF1), along 
with the enzyme alkaline phosphatase, which 
makes them similar to previously described 
mesenchymal cells that also surround blood 
vessels in the bone marrow’. But the Lepr- 
expressing cells do not produce nestin, which 
underscores the diversity among mesenchymal 
stromal cells. Because several genetic tools are 
available to control gene expression of spe- 
cific cells in mice, a next logical step should 
be determining whether Lepr-expressing 
mesenchymal cells are one of the main sources 
of other HSC-regulating signals. 

Different vertebrates have HSC niches 
in different tissues, and the location of the 
HSC niche can change during an organisms 
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open the door to its use for investigations 
of metabolism and cell-lineage tracking in 
humans. Francesca Cesari and Deepa Nath 
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3. Rzadzinska, A. K. et al. J. Cell Biol. 164, 887-897 
(2004). 


lifetime’. In mammals, HSCs are found first in 
the aorta, then in the liver and, finally, in the 
bone marrow. In fish, HSCs also start out in the 
aorta, but then go on to occupy tissue in the tail 
and, eventually, the kidneys. All of these seem- 
ingly unrelated vertebrate tissues have vascular 
endothelial cells surrounded by mesenchymal 
cells, which — as Ding et al.’ now highlight 
— are required for regulating HSCs in mice. It 
remains unclear, however, whether different 
tissue-specific cell types that make up the bone 
marrow, liver and kidneys provide similar 
signals to regulate HSC activity. 

At present, clinically useful HSCs cannot 
be efficiently cultured in the lab. The presence 
of mesenchymal cells in HSC cultures has 
occasionally been linked to enhanced HSC 
growth”®, but which mesenchymal cell types 
are responsible for this effect has been difficult 
to determine. Robust HSC growth could per- 
haps be achieved by culturing them on top of 
mesenchymal ‘feeder’ cells that express specific 
markers such as LEPR or nestin. According to 
Ding and colleagues’ results’, only one in 8,000 
bone-marrow cells expresses LEPR, yet these 
rare cells are the primary source of the SCF 
needed to support HSCs. Unfortunately, cul- 
turing LEPR- or nestin-expressing cells is not 
an easy task, so more robust conditions should 
be developed to culture such cells. 


C. P. LECHENE AND COLLEAGUES 


The approach used by Ding et al.’ to iden- 
tify the main sources of SCF in the haemato- 
poietic niche, although labour-intensive, sets 
a standard of rigour for researchers studying 
the bone marrow, and should also be applied 
to other signals that affect HSCs. Analysis of 
the gene-expression signatures of specific 
cells that express other major signalling mol- 
ecules involved in HSC regulation may allow 
the identification of marker genes in addition 
to Lepr. These marker genes could be used to 
isolate cells that support HSC growth from 
the mesenchymal milieu of the bone marrow. 
Understanding the complex HSC niche will 
one day make it possible to create a synthetic 
microenvironment capable of sustaining 
long-term HSC growth. = 
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Chemical beauty contest 


Most drug candidates fail clinical trials, in many cases because the compounds 
have less than optimal physico-chemical properties. Anew method for assessing 
the ‘drug-likeness’ of compounds might help to remedy the situation. 


PAUL LEESON 


a sense of chemical aesthetics — a feel 

for how drug-like any particular mol- 
ecule is. But is it possible to measure such 
chemical beauty? Reporting this week in 
Nature Chemistry, Hopkins and colleagues’ 
provide a quantitative estimate of drug-like- 
ness that assesses a combination of a molecule’s 
physical properties. Unlike the commonly used 
descriptions of drug-likeness, their approach 
allows a single, continuous scale to be defined, 
so that molecules can be ranked in order of 
desirability. 

Drugs are developed from the optimization 
of ‘lead’ molecules, which are frequently found 
through the biological screening of compound 
collections. Before being finally accepted into 
use by regulatory and paying bodies, an opti- 
mized drug candidate must undergo years of 
intensive toxicological and clinical-efficacy 
studies. Most orally active drugs that survive 
these arduous developmental pressures have 
a set of physico-chemical properties that fall 
within a certain range of values — they are 
said to lie in a defined physical and chemical 
‘drug-like space’. Until now, this drug-like 
space has been defined using cut-off values 
for permissible physical properties, perhaps 
most notably the values defined by the medi- 
cinal chemist Christopher Lipinski and his 
colleagues in the ‘rule of five” (Box 1). 


Hesseneer medicinal chemists develop 


Hopkins and co-workers’ point out that 
Lipinski’s rule can be misleading, because 
undesirable compounds could pass the drug- 
likeness test by only just meeting all four cut-off 
criteria, whereas better compounds could 
fail because they just miss one of the cut-offs. 
The application of the rule in this unintended 
way may help to explain why the compounds 
in current patents from pharmaceutical 
companies are, on average, significantly less 
drug-like than marketed drugs**. 

Taking a cue from a study’ that used math- 
ematical ‘desirability functions’ to assess how 
suitable a range of compounds would be 
as drugs that act in the central nervous sys- 
tem, Hopkins and co-workers’ used a similar 
approach to analyse the drug-likeness of a set 
of 771 oral drugs approved by the US Food and 
Drug Administration. The authors defined 
desirability functions for eight physical prop- 
erties proposed to be important for oral drugs, 
including the four Lipinksi properties. They 
also took into account the number of aromatic 
rings and rotatable bonds in a molecule, the 
polar surface area (a measure of how hydro- 
philic a molecule is) and the number of groups 
in the molecule known to cause toxicity. The 
functions captured the full distribution of each 
physical property and provided a continuous 
quantitative estimate of drug-likeness (QED) 
ona scale from most to least drug-like. 

Because the bulk physical properties of com- 
pounds are known to correlate with each other 
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50 Years Ago 


Itis the purpose of this article to 
show that a group of compounds 
related to lysergic acid diethylamide 
(LSD-25) produces surfacing 
behaviour of carp with the 
movement directed towards 

the surface ... It has been shown 
previously from work in this 
laboratory that very small quantities 
of derivatives of lysergic acid, like 
lysergic acid diethylamide 
(LSD-25) and lysergic acid 
ethylamide (LAE-32), havea 
surfacing effect on small Siamese 
fighting fish ... After the fish had 
been exposed to LSD-25 for 10 min., 
they showed signs of LSD-25 
activity. After 30 min. all three fish 
in the tank containing LSD-25 
were at the surface of the liquid ina 
nose up-tail down position ... For 
the next hour the fish in the tank 
containing LSD-25 remained at the 
surface, from time to time moving 
and even swimming backwards 

... The fish were returned to the 
running-water pool after 1.5 hr. 

In the pool they continued to stay 
at the surface, moving about but 
not going to the bottom at all. 2 hr. 
later they were still at the surface ... 
Experiments in larger tanks, and 
field trials, are planned. 

From Nature 27 January 1962 


100 Years Ago 


Ihave repeatedly observed the 
brilliancy of cats’ eyes in the 

dark in particularly favourable 
circumstances. I have a brilliant 
incandescent light in my hall, and 
several cats on the premises. The 
entrance drive is in a line with the 
door and the hall lamp. When I 
call a cat in the chances are that if 
there she simply sits and looks at 
me, presenting the spectacle of two 
small incandescent lights glowing 
out of the darkness. Light, observer, 
and cat are all three in line, as 
observed by Colonel Herschel. 
From Nature 25 January 1912 
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to some extent, Hopkins and colleagues used 
different property weightings to maximize the 
overall information content of the combined 
QED values for each drug. This weighting sys- 
tem will be controversial to some, because it 
may not reflect the importance of each property 
to drug-likeness. For example, there is a strong 
case to be made for lipophilicity as the domi- 
nant drug-like property, because it is impor- 
tant for a molecule’s absorption, metabolism, 
promiscuity (binding to unwanted targets), 
toxicity and survival in the drug-development 
pipeline**®’. Nevertheless, the benefits of the 
authors’ strategy are clear: their method not 
only computes drug-likeness ona single quan- 
titative scale, but, more importantly, it reveals 
that drugs that fail the Lipinski criteria have 
distributions of drug-likeness that overlap with 
drugs that pass the criteria (Fig. 1). 

Hopkins and colleagues went on to show 
that their QED approach is better at differen- 
tiating drugs from non-drugs than the Lipinksi 
rule and other schemes based on cut-off val- 
ues. They further validated the discriminatory 
power of their method by comparing QED 
scoring with the results of a study’ in which 79 
medicinal chemists decided which of 17,117 
molecules were attractive starting points for 
optimization as drugs, on the basis of only 
visual inspection of the molecular structures. 
Impressively, the QED scores for molecules 
considered to be attractive by the chemists 
were significantly higher than those for mol- 
ecules considered unattractive. This suggests 
that the QED method may, at least in part, 
capture a sense of the chemical aesthetics that 
medicinal chemists develop through knowl- 
edge, experience and intuition”. 

Finally, Hopkins and co-workers used 
their method to predict the drug-likeness of 
167,045 ligand compounds that bind to 1,729 
proteins (on the basis of binding information 
from ChEMBL, a database of biologically 
active compounds”). This allowed them to 
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BOX1 


The Lipinski rule of five 


The medicinal chemist Christopher 

Lipinski and his colleagues analysed? the 
physico-chemical properties of more than 
2,000 drugs and candidate drugs in clinical 
trials, and concluded that a compound is 
more likely to be membrane permeable and 
easily absorbed by the body if it matches the 
following criteria: 


@ Its molecular weight is less than 500. 

@ The compound's lipophilicity, expressed as 
a quantity known as logP (the logarithm of 
the partition coefficient between water and 
1-octanol), is less than 5. 

@ The number of groups in the molecule 
that can donate hydrogen atoms to 
hydrogen bonds (usually the sum of 
hydroxyl and amine groups in a drug 


determine which proteins had the most drug- 
like set of ligands. Proteins whose ligands had 
the highest QED scores should be the most 
chemically tractable targets for drug discov- 
ery, because their known ligands are the most 
drug-like. 

QED is not the final word in understand- 
ing the underlying features of chemical beauty 
and drug-likeness, but it does provide a holis- 
tic, more balanced assessment than previous 
approaches. It is also customizable, highly flex- 
ible and seems to be straightforward to imple- 
ment. Any combination of physical properties 
can be chosen to define desirability functions, 
and users can set the relative weightings for 
properties as desired. QED should therefore 
find immediate use in replacing cut-off rules 
for the selection of oral-drug-like compounds 
for screening. The method can also be applied 


© Lipinski fail 


® Lipinski pass 
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Figure 1 | Assessments of ‘drug-likeness. Physico-chemical criteria, such as those defined by the Lipinski 
rule of five, are typically used to predict whether a compound is drug-like or not. a, The bar chart shows the 
number of oral drugs that fail or pass the Lipinski rule, based on a set of 771 drugs approved by the 

US Food and Drug Administration. b, Hopkins and colleagues’ report a method for predicting the drug- 
likeness of compounds on a scale of 0 (not drug-like) to 1 (drug-like). The chart shows the distribution of 
drug-likeness calculated using this method for the same drugs depicted in a. The analysis shows an overlap 
of Lipinski passes and failures for a range of drug-likenesses. Notably, some very drug-like molecules fail 
the Lipinski rule, whereas some very un-drug-like compounds pass it. 
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molecule) is less than 5. 

@ The number of groups that can accept 
hydrogen atoms to form hydrogen bonds 
(estimated by the sum of oxygen and 
nitrogen atoms) is less than 10. 


The rules, based on the 90-percentile values 
of the drugs’ property distributions, apply 
only to absorption by passive diffusion 

of compounds through cell membranes; 
compounds that are actively transported 
through cell membranes by transporter 
proteins are exceptions to the rule. Due in 
no small part to their simplicity, the Lipinski 
criteria are widely used by medicinal 
chemists to predict not only the absorption 
of compounds, as Lipinski originally 
intended, but also overall drug-likeness. 


to control sets other than oral drugs, such as 
lead-like molecules’, compounds that belong 
to specific target and therapeutic classes, and 
drugs that are administered non-orally. 

The widest long-term impact of Hopkins 
and colleagues’ method’ should be on the 
optimization of lead compounds, where it will 
help medicinal chemists to prioritize which 
drug-like compounds to prepare. It is to be 
hoped that the implementation of improved 
guidelines for drug-likeness, such as QED, at 
this stage of drug discovery will improve the 
quality’ of candidate drug molecules, and 
eventually help to reduce the 96% attrition 
rate of compounds that enter clinical trials’. m 


Paul Leeson is at the GlaxoSmithKline 
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ARTICLE 


doi:10.1038/nature10783 


Endothelial and perivascular cells 
maintain haematopoietic stem cells 


Lei Ding’, Thomas L. Saunders”, Grigori Enikolopov’ & Sean J. Morrison! 


Several cell types have been proposed to create niches for haematopoietic stem cells (HSCs). However, the expression 
patterns of HSC maintenance factors have not been systematically studied and no such factor has been conditionally 
deleted from any candidate niche cell. Thus, the cellular sources of these factors are undetermined. Stem cell factor (SCF; 


also known as KITL) is a key niche component that maintains HSCs. Here, using Scf*” 


knock-in mice, we found that Scf 


was primarily expressed by perivascular cells throughout the bone marrow. HSC frequency and function were not affected 
when Scf was conditionally deleted from haematopoietic cells, osteoblasts, nestin-cre- or nestin-creER-expressing cells. 
However, HSCs were depleted from bone marrow when Scf was deleted from endothelial cells or leptin receptor 
(Lepr)-expressing perivascular stromal cells. Most HSCs were lost when Scf was deleted from both endothelial and 
Lepr-expressing perivascular cells. Thus, HSCs reside in a perivascular niche in which multiple cell types express 


factors that promote HSC maintenance. 


Stem cells are maintained in specialized microenvironments in tis- 
sues—termed niches—in which supporting cells secrete factors that 
promote stem cell maintenance’. In most mammalian tissues, includ- 
ing the haematopoietic system, the identities of the cells that promote 
stem cell maintenance remain uncertain’”. One popular model of the 
HSC niche proposed that osteoblasts express many factors that pro- 
mote HSC maintenance’, including SCF, CXCL12, angiopoietin 1 and 
thrombopoietin* ’”. However, none of these factors have been con- 
ditionally deleted from osteoblasts, so there is no direct evidence that 
osteoblasts are a functionally important source of these factors. 

We found that most HSCs localize adjacent to sinusoidal blood vessels 
throughout the bone marrow’*”. This led us to hypothesize that the HSC 
niche is perivascular~’. Others found that perivascular stromal cells 
secrete high levels of CXCL12 and other factors proposed to regulate 
HSC maintenance’*". Nestin-expressing mesenchymal stem cells also 
localize adjacent to blood vessels in the bone marrow and express factors 
thought to promote HSC maintenance’. Ablation of the Cxcl12- 
expressing cells or the nestin-expressing cells reduced HSC frequency’*”’. 
Administration of antibodies against endothelial cells in vivo impairs 
HSC engraftment and transformed endothelial cells promote HSC 
expansion in culture’*'®. Nonetheless, no factor that promotes HSC 
maintenance has been conditionally deleted from any perivascular cell, 
so there is no direct evidence that endothelial or perivascular cells are 
functionally important sources of factors for HSC maintenance in vivo. 

SCF is non-cell autonomously required for HSC maintenance in 
vivo’. Differential splicing and proteolytic cleavage lead to the 
expression of a membrane-bound form and a soluble form of SCF. 
HSCs are depleted in S//SI“ mutant mice”’, which express soluble SCF 
but lack the membrane-bound form, indicating that membrane- 
bound SCF is particularly important for HSC maintenance’’. Mice 
with a mixture of wild-type and SI/SI stromal cells only exhibit nor- 
mal haematopoiesis in the immediate vicinity of the wild-type cells, 
demonstrating that SCF acts locally in creating the niche”. 

Scf has been suggested to be expressed by endothelial cells, bone 
marrow fibroblasts, osteoblasts, Cxcll2-expressing perivascular 


stromal cells and nestin-expressing mesenchymal stem cells*!?!*??~*, 


but Scf has not been conditionally deleted to test which source(s) are 
functionally important for HSC maintenance. We generated Scf” and 
Scf" mice to systematically examine Scf expression and to conditionally 
delete Scf from subpopulations of bone marrow cells. 


Scf is expressed by perivascular cells 

We generated Sc” knock-in mice by inserting enhanced green fluor- 
escent protein (gfp) into the endogenous Scf locus (Supplementary Fig. 
la-c). Sof”? mice died perinatally (Fig. 1a and Supplementary Fig. 1f, 
g) with severe anaemia (Fig. 1b and Supplementary Fig. 2c), as observed 
in mice with a strong loss of SCF/c-Kit function’’. By quantitative 
reverse transcription—polymerase chain reaction (qRT-PCR), Sef tran- 
scripts were nearly undetectable in Sc“? newborn liver (Fig. 1c). 

The overall cellularity of the newborn liver was reduced about 
twofold in Scf’* and about fivefold in Sc’? mutant mice com- 
pared to Scf*’* controls (Fig. 1d). The frequency of HSCs 
(CD150* CD48 CD41 Scal* cKit* cells®*°) in the newborn liver 
was reduced about eightfold in Scf”*? mutant mice compared to 
littermate Scf?/* or Scf*/* controls (Fig. le). Consistent with this, 
newborn Scf®?’®? liver cells gave significantly lower levels of donor cell 
reconstitution in irradiated mice compared to Scf*””* or Scf*’* con- 
trols (Fig. 1f and Supplementary Fig. 2d). Sc”? mice therefore have 
a severe loss of SCF function. 

Using flow cytometry we determined that only rare 
(0.027 + 0.0099%, mean + standard deviation (s.d.)) enzymatically 
dissociated bone marrow cells were positive for GFP. The actual fre- 
quency of GFP* cells in the bone marrow may be somewhat higher as 
our dissociation conditions may not have recovered all of the GFP* 
stromal cells. These GFP* cells were negative for CD45 and Ter119, 
indicating a non-haematopoietic source of SCF (Fig. 1g). Endogenous 
Scf transcripts were highly enriched in GFP* stromal cells and highly 
depleted in GFP stromal cells (Supplementary Fig. 2f, g), suggesting 
that GFP expression faithfully reflected endogenous Scf expression. 

GFP was mainly expressed by cells surrounding sinusoids through- 
out the Sc?’* bone marrow, with some expression by cells surround- 
ing venuoles and arterioles (Fig. lh-m and Supplementary Fig. 2h, i). 


lHoward Hughes Medical Institute, Children’s Research Institute, Department of Pediatrics, University of Texas Southwestern Medical Center, Dallas, Texas 75390, USA. Transgenic Animal Model Core, 
University of Michigan, Ann Arbor, Michigan 48109-2216, USA. °Cold Spring Harbor Laboratory, Cold Spring Harbor, New York 11724, USA. 
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Figure 1 | Sc is a strong loss-of-function allele and Scf is primarily 
expressed by perivascular cells in the bone marrow. a, b, Scf”®? homozygous 
mice died perinatally and were severely anaemic (1 = 4-20). RBC, red blood cell. 
c, Sef transcripts in livers from newborn mice by qRT-PCR (n = 3). d, e, Newborn 
liver cellularity and HSC frequency (n = 4). f, Irradiated mice (CD45.1*) were 
transplanted with 3 X 10° newborn liver cells from Scf?, Scf?/* or Scf*/* 
donor (CD45.2*) mice along with 3 X 10° recipient (CD45.1” ) bone marrow 
cells (3-4 experiments with 13-18 mice per genotype). g, Scf-GFP was expressed 
by rare non-haematopoietic stromal cells (n = 8). h-j, GFP was primarily 
expressed by perivascular cells in the bone marrow of Scf”’~ mice. Endothelial 
cells were stained with an anti-endoglin antibody. k-n, GFP was not detected in 
bone-lining osteoblast lineage cells (osteopontin) in the diaphysis (k-m) or in 
trabecular bone (n). o-q, Higher magnification images of a sinusoid. r-u, A 
CD150* CD48" lineage candidate HSC (arrow) localized adjacent to a GFP- 
expressing perivascular cell. Nuclei were stained with 4',6-diamidino-2- 
phenylindole (DAPI; in blue). All data represent mean + s.d. Two-tailed 
Student’s t-tests were used to assess statistical significance. **P < 0.01, 

** P< (),001. Scale bars in j, m and n are 50 tum. Scale bars in q and ware 20 um. 


GFP partially overlapped with endothelial marker staining (Fig. 1h-j, 
o-q and Supplementary Fig. 2i), suggesting that both endothelial and 
perivascular stromal cells express Scf. In contrast, GFP was not con- 
centrated near the endosteum (Supplementary Fig. 2h) and we did not 
detect any GFP expression by bone-lining cells that expressed osteo- 
blast markers in either the diaphysis (Fig. 1k—-m) or trabecular bone 
(Fig. 1n). Perivascular stromal and endothelial cells therefore 
appeared to represent the major sources of SCF in bone marrow. 
We isolated Scf-GFP* cells by flow cytometry and performed gene 
expression profiling. Several mesenchymal stem/stromal cell markers, 
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including Cxcl12, alkaline phosphatase, Vcam1, Pdgfra and Pdgfrb 
were highly elevated in Scf-GFP™ cells relative to whole bone marrow 
cells (Supplementary Table 1). This indicates that Scf-GFP* cells 
included mesenchymal stem/stromal cells*” and Cxcl12-expressing 
perivascular stromal cells'®. Nestin was not detected in Scf-GFP* 
perivascular cells (Supplementary Table 1). 

As we observed previously*®, CD150° CD48° lineage candidate 
HSCs were mainly found adjacent to sinusoidal blood vessels 
throughout the bone marrow. Sixty-five per cent (47/73) of all 
CD150* CD48" lineage” cells were immediately adjacent to GFP- 
expressing stromal cells (Fig. 1r-u). Almost all of the remaining cells 
(30%; 22/73) were within five cell diameters of GFP-expressing cells. 
This suggests that Scf-GFP-expressing cells form a perivascular niche 
for HSCs. 


Scf is required by adult HSCs 


We generated a floxed allele of Scf (Scf") to conditionally delete Scf 
from candidate niche cells (Supplementary Fig. 3a—c). Mice homo- 
zygous for the germline recombined allele of Scf—Scf ’~ —were peri- 
natally lethal and anaemic (Fig. 2a), similar to other Scf-deficient mice 
(Fig. 1a)!’. Recombination of the Scf" allele therefore led to a strong 
loss of SCF function. We were unable to amplify Scf transcripts by 
PCR from the liver of Scf ‘~ newborns (Fig. 2b). 

We generated Ubc-creER; Scf“" mice to ubiquitously delete Scf 
upon tamoxifen administration. We administered tamoxifen- 
containing chow to Ubc-creER; Sc “ft mice and littermate controls for 
1-2 months beginning at 8 weeks of age, and then killed them for 
analysis. Some of the mice became anaemic and ill during tamoxifen 
administration. The Ubc-creER; Scf“" mice had significantly lower red 
blood cell counts than controls (Fig. 2c) and a trend towards lower 
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Figure 2 | Scfis required for adult HSC maintenance. a, Homozygous Scf “~ 
mutant mice generated from germline recombination of the Scf" allele were 
perinatally lethal and anaemic. b, Scf transcripts amplified by RT-PCR from the 
livers of newborn mice. c, Global deletion of Scf in Ubc-creER; Sof" mice led to 
anaemia (n = 5-6). d, e, Global deletion of Scfin adult mice significantly reduced 
cellularity and HSC frequency in bone marrow (two femurs and two tibias) and 
spleen (n = 8-10). BM, bone marrow. f, To perform a limit dilution analysis”, 
three doses of donor bone marrow cells were competitively transplanted into 
irradiated mice. ELDA software (http://bioinf.wehi.edu.au/software/elda/) was 
used to calculate HSC frequency and assess statistical significance (two 
experiments). g, 3 X 10° donor bone marrow cells were transplanted with 

3 X 10° recipient bone marrow cells into irradiated recipient mice (three 
experiments with a total of 12-14 recipients per genotype). h, HSCs did not 
express Scf-GEP by flow cytometry. A, recombined Sef allele; +, wild-type allele 
of Scf. All data represent mean + s.d. *P < 0.05, **P < 0.01, ***P < 0.001. 
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white blood cell and platelet counts (Supplementary Fig. 3d). Ubc- 
creER; Scf" mice exhibited approximately twofold reductions in the 
overall cellularity of bone marrow and spleen compared to controls 
(Fig. 2d). 

CD150* CD48" Lin” Scal* c-Kit* HSCs were also depleted in the 
bone marrow and spleen of Ubc-creER; Scf“" mice compared to con- 
trols treated concurrently with tamoxifen (Fig. 2e). Limit dilution 
analysis demonstrated that long-term multilineage reconstituting 
cells were 3.5-fold less frequent in the bone marrow of Ubc-creER; 
Sef" “ mice compared to controls upon transplantation into irradiated 
mice (Fig. 2f). Bone marrow cells from Ubc-creER; Scf“" mice gave 
significantly lower levels of donor cell reconstitution in irradiated 
mice (Fig. 2g). These data confirmed that SCF is required for HSC 
maintenance in adult mice. 

CD150° CD48° Lin” Scal* cKit* HSCs from Sc?’ mice did not 
express GFP by flow cytometry (Fig. 2h). This is consistent with prior 
studies'”*'”* in suggesting that Scf non-cell autonomously regulates 
HSC maintenance. To test the role of other haematopoietic cells we 
conditionally deleted Scf using Vavl-cre. As expected”, Vav1-cre 
recombined a conditional loxP-EYFP reporter” in virtually all 
HSCs, CD45* and Terl19~ haematopoietic cells (Fig. 3a and 
Supplementary Fig. 4a). Eight-week-old Vav1-cre; Scf” mice exhibited 
normal blood cell counts, bone marrow composition (Supplemen- 
tary Fig. 4b, c), and bone marrow and spleen cellularity (Fig. 3b). 
Scf‘’~ heterozygous mice exhibited a twofold decline in the frequency 
of CD150* CD48" Lin” Scal* c-Kit* HSCs relative to wild-type 
littermates. However, deletion of the second allele of Scf from haema- 
topoietic cells in Vavl-cre; Scf” mice did not further reduce HSC 
frequency in the bone marrow or spleen (Fig. 3c). Bone marrow cells 
from adult Vav1-cre; Scf” mice had a normal capacity to reconstitute 
irradiated mice (Fig. 3d and Supplementary Fig. 4d) and to form 
colonies in methylcellulose (Supplementary Fig. 4e, f). Therefore, Scf 
expression by haematopoietic cells is not required for HSC mainten- 
ance in adult bone marrow. 


HSCs do not require SCF from osteoblasts 


Col2.3-Cre recombines genes in fetal and postnatal osteoblasts”. 
Consistent with this, we found strong enhanced yellow fluorescent 
protein (EYFP) expression among bone-lining cells in Col2.3-cre; 
loxP-EYFP mice (Fig. 3e). To test whether osteoblasts produce SCF 
for HSC maintenance, we analysed 8-week-old Col2.3-cre; Sef” mice. 
Col2.3-cre; Sef” mice had normal blood counts (Supplementary Fig. 
5a), normal lineage composition in the bone marrow and spleen 
(Supplementary Fig. 5b) and normal bone marrow and spleen cellularity 
(Fig. 3g). Although Scf*’~ germline heterozygous mice exhibited a two- 
fold decline in the frequency of CD150° CD48" Lin” Scal* c-Kit* 
HSCs relative to wild-type littermates, conditional deletion of the second 
allele of Scf from osteoblasts in Col2.3-cre; Sof” mice did not further 
reduce HSC frequency in the bone marrow or spleen (Fig. 3h). Bone 
marrow cells from Col2.3-cre; Scf” mice had a normal capacity to 
reconstitute irradiated mice (Fig. 3i and Supplementary Fig. 5c) and 
to form colonies in methylcellulose (Supplementary Fig. 5d, e). 
Therefore, Scf expression by osteoblasts is not required for HSC main- 
tenance in adult bone marrow. 


HSCs do not require SCF from nestin* cells 


In nestin-cre; loxP-EYFP mice we found rare EYFP-expressing peri- 
vascular stromal cells around larger blood vessels, not sinusoids, in 
the bone marrow (Fig. 3f). These cells exhibited a very different 
distribution than Scf-expressing cells (compare Fig. 3f to Fig. lh-m 
and Supplementary Fig. 2h, i). Eight-week-old nestin-cre; Sef” mice 
had normal blood cell counts (Supplementary Fig. 6b), normal lineage 
composition and cellularity in the bone marrow and spleen 
(Supplementary Fig. 6c and Fig. 3j). Comparing nestin-cre; Scf" 
mutants with Scf‘’~ controls, deletion of Scf from nestin-cre-expressing 
cells did not reduce HSC frequency in the bone marrow (Fig. 3k). 
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Figure 3 | SCF from haematopoietic cells, osteoblasts and nestin-Cre- 
expressing stromal cells is not required for HSC maintenance. a, Vav1-Cre 
recombined the loxP-EYFP reporter in virtually all HSCs, CD45* and Terl19* 
haematopoietic cells. b, c, Deletion of Scf from haematopoietic cells did not 
significantly affect bone marrow or spleen cellularity or HSC frequency (n = 4). 
d, A competitive reconstitution assay with Vavi-cre; Scf””, Scf*’~ and Scf*/* 
bone marrow cells (2 experiments with a total of 10 recipients per genotype). 
e, Col2.3-Cre recombined the loxP-EYFP reporter in bone-lining osteoblast 
lineage cells. f, Nestin-Cre recombined the loxP-EYFP reporter in rare stromal 
cells around larger blood vessels. g, h, Bone marrow and spleen cellularity 

(g) and HSC frequency (h) in Col2.3-cre; Sef" ’~ mice relative to controls 

(n = 5-6). i, A competitive reconstitution assay with Col2.3-cre; Sef”, Scf*/— 
and Scf*’* bone marrow cells (3-5 experiments with a total of 14-22 recipients 
per genotype). j, k, Bone marrow and spleen cellularity (j) and HSC frequency 
(k) in nestin-cre; Scf" ’~ mice relative to controls (n = 5-7). 1, 3 X 10° donor 
bone marrow cells from nestin-cre; Sef” and Scf*/~* mice gave similar levels of 
donor cell reconstitution in irradiated mice. Reconstitution levels from Scf*’~ 
cells were modestly but significantly lower (3-5 experiments with a total of 14- 
24 recipient mice per genotype). A, recombined Sef" allele; +, wild-type allele; 
—, germline deleted allele. Scale bar is 100 tm in e and 50 um in f. All data 
represent mean + s.d. NS, not significant. *P < 0.05, **P < 0.01, 

***D <(0,001. 
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Nestin-cre; Scf”~ mice did exhibit a significant decline in HSC fre- 
quency in the spleen (Fig. 3k), raising the possibility that nestin- 
cre-expressing cells are a component of the HSC niche in the spleen. 
Bone marrow cells from adult nestin-cre; Sef" ’~ mice had a normal 
capacity to reconstitute irradiated mice (Fig. 3l and Supplementary Fig. 
6d). Conditional deletion of Scf by administering tamoxifen for 2-5 
months to adult nestin-creER; Scf" ‘ mice also did not affect haemato- 
poiesis, HSC frequency, or reconstituting capacity in irradiated mice 
(Supplementary Fig. 7). Therefore, Scf expression by nestin-cre- 
expressing or nestin-creER-expressing perivascular cells is not required 
for the maintenance of HSCs in adult bone marrow. 

Because nestin-GFP-expressing bone marrow cells express Scf'”, 
we independently characterized nestin-GFP expression. Consistent 
with the prior report’’, we observed strong nestin-GFP staining along 
larger vessels in the bone marrow (Supplementary Fig. 8; see supple- 
mentary figure 1 from ref. 12). Nestin-GFP was also observed in 
perisinusoidal stromal cells in a pattern that resembled Scf-GFP* 
expression (Supplementary Fig. 8a). This appeared to be different 
from the nestin-cre expression pattern, which we detected only 
around larger blood vessels in the bone marrow (Fig. 3f). In nestin- 
Cherry and nestin-GFP double transgenic mice we detected nestin- 
Cherry expression around larger vessels but not around sinusoids, 
whereas nestin-GFP was detected around both (Supplementary Fig. 8). 
Thus, different nestin transgenes seem to be expressed by different 
subpopulations of perivascular stromal cells. Nestin-GFP appears to 
exhibit more expression in perisinusoidal stromal cells than other 
nestin transgenes. Our data are therefore consistent with the possibility 
that nestin-GFP-expressing stromal cells contribute to the HSC niche 
as previously suggested’’, even though conditional deletion of Scf with 
nestin-cre and nestin-creER did not affect HSC frequency. 


HSCs require SCF from endothelial cells 
We conditionally deleted Scf from endothelial cells using Tie2-cre*’. 
Tie2-Cre recombined in endothelial (Fig. 4a) and haematopoietic cells 
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Figure 4 | Deletion of Scf from endothelial cells depletes HSCs. a, b, Tie2- 
Cre recombined the loxP-EYFP conditional reporter in VE-cadherin* 
endothelial cells and in haematopoietic cells in the bone marrow. c, Bone 
marrow and spleen cellularity in Tie2-cre; Scf”~ mice and littermate controls 
(n = 4-7). d, HSC frequency in Tie2-cre; Sef mice and controls (n = 4-7). 
e, HSC frequency in the liver of newborn Tie2-cre; Sof” ~ mice and controls 
(n = 3-6). PO, postnatal day 0. f, HSC frequency in 1-month-old Tie2-cre; 
Sef" /~ mice and controls (n = 3-4). g, Bone marrow cells from Tie2-cre; Sef" a 
mice gave significantly lower levels of reconstitution relative to cells from 
Sof’’” and Scf*’" mice (3-5 experiments with a total of 15-25 recipients per 
genotype). All data represent mean + s.d.*P < 0.05, **P < 0.01, ***P < 0.001. 
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(Fig. 4b) but not in mesenchymal stem/stromal cells from the bone 
marrow (Supplementary Fig. 9d, e). Because haematopoietic cells do 
not express Scf (Fig. 1g and Fig. 2i) and conditional deletion of Scf 
from haematopoietic cells did not affect HSC frequency (Fig. 3a-d), 
the use of Tie2-cre allowed us to test whether SCF expression by 
endothelial cells regulates HSC frequency. 

Eight-week-old Tie2-cre; Scf”~ mice exhibit normal blood cell counts 
(data not shown), bone marrow and spleen cellularity (Fig. 4c). 
However, the frequency of CD150* CD48° Lin” Scal* c-Kit* HSCs 
in the bone marrow was significantly reduced in Tie2-cre; Scf”~ mice 
relative to controls (Fig. 4d). Consistent with this, 300,000 bone marrow 
cells from Tie2-cre; Sef” mice gave significantly lower levels of donor 
reconstitution upon transplantation into irradiated mice (Fig. 4g). In 5 
independent experiments, 24 of 25 recipients of Scf*’* cells, 15 of 15 
recipients of Scf*’~ cells, and only 7 of 21 recipients of Tie2-cre; Scf”— 
cells were long-term multilineage reconstituted. By Poisson statistics this 
corresponds to an HSC frequency in control bone marrow of at least 
1/93,200 cells but only 1/739,900 in Tie2-cre; Scf” mice. Endothelial 
cells are therefore an important source of SCF for HSC maintenance. 

The HSC depletion in Tie2-cre; Scf” mice probably reflects an 
ongoing need for SCF expression by endothelial cells in adult bone 
marrow, because when HSCs are depleted as a consequence of 
reduced SCF/c-Kit signalling, HSC frequencies return to normal 
levels upon restoration of normal SCF/c-Kit  signalling’??!. 
Nonetheless, we also examined whether SCF expression by endothelial 
cells during development is required by HSCs. We founda 1.7-2.1-fold 
reduction in HSC frequency in the liver of newborn Tie2-cre; Scf”— 
mice (Fig. 4e) and a twofold reduction in HSC frequency in the bone 
marrow of 1-month-old Tie2-cre; Scf” mice (Fig. 4f) relative to Scf/~ 
and Scf*’* controls. The magnitude of HSC depletion in adult bone 
marrow seemed to increase, as we found a 2.7-fold and 5.2-fold reduc- 
tion in HSC frequency in the bone marrow of 8-week-old Tie2-cre; 
Scf”~ mice relative to Scf*’” and Scf*’* controls, respectively 
(Fig. 4d). These data suggest that ongoing SCF expression by endothe- 
lial cells in adult bone marrow contributes to HSC maintenance; 
however, HSC depletion in adult bone marrow may reflect a loss of 
SCF expression by endothelial cells during development. 


HSCs require SCF from perivascular cells 


We found that Lepr is highly restricted in its expression within the 
bone marrow to Scf-GFP-expressing perivascular stromal cells 
(Supplementary Table 1). Consistent with this, Lepr-cre; loxP-EYFP 
mice exhibited EYFP expression in perivascular stromal cells (Fig. 5b, 
e) but not in haematopoietic cells (Fig. 5b, c, e), bone-lining cells 
(Fig. 5c), or endothelial cells (Fig. 5d). 

Consistent with the gene expression profile of Scf-GFP* cells 
(Supplementary Table 1), EYFP* cells from Lepr-cre; loxP-EYFP mice 
did not detectably express nestin but did express mesenchymal stem/ 
stromal cell markers including Cxcl12, alkaline phosphatase, 
PDGFR« and PDGFRB (Supplementary Fig. 9a-c). These data indi- 
cate a mesenchymal identity for the Lepr-expressing stromal cells; 
however, the lack of EYFP expression in bone-lining cells from 
Lepr-cre; loxP-EYFP mice suggests that the Lepr-cre-expressing peri- 
vascular cells did not give rise to osteoblasts during normal develop- 
ment. Future studies will be required to assess the relationship 
between Lepr-cre-expressing perivascular cells, mesenchymal stem 
cells, and other perivascular stromal cells. 

Bone marrow cellularity was significantly reduced in Lepr-cre; 
Scf“? mice compared to Scf*’* controls, but not compared to 
Scf*/*? controls (Fig. 5g). Spleen size ve 5f) and cellularity were 
significantly increased in Lepr-cre; Scf®? mice (Fig. 5g). Sections 
through the spleen revealed increased extramedullary haematopoiesis 
in Lepr-cre; Scf”®? mice (data not shown). The frequency of 
CD150* CD48" Lin” Scal* c-Kit" HSCs was significantly reduced in 
the bone marrow of Lepr-cre; Scf”” mice, but significantly increased in 
the spleen (Fig. 5h). The total number of bone marrow and spleen HSCs 
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Figure 5 | Deletion of Scffrom Lepr-cre-expressing perivascular stromal 
cells depletes HSCs in the bone marrow. a-c, Lepr-Cre recombined the loxP- 
EYFP reporter in perisinusoidal stromal cells in the bone marrow but not in 
bone-lining or haematopoietic cells. d, Lepr-Cre did not recombine in VE- 
cadherin* endothelial cells. e, 0.013 + 0.009% (mean = s.d.; n = 3) of bone 
marrow cells from Lepr-cre; loxP-EYFP mice were EYEP". f, g, Spleen size 

(f) and bone marrow and spleen cellularity (g) (mn = 4-7). h, HSC frequency 
(n = 4-7). i, Total HSC numbers (including bone marrow and spleen) in Lepr- 
cre; Scf”” mice (n = 4-7). j, Limit dilution analysis” of the frequency of long- 
term multilineage reconstituting cells in the bone marrow of Lepr-cre; Scf"” 
mice relative to controls (two experiments). k, HSC frequency in the newborn 
liver (n = 4-11). 1, HSC frequency in 1-month-old Lepr-cre; Scf”®? mice and 
controls (n = 3-6). m, Tie2-cre; Lepr-cre; Scf”— mice had significantly reduced 
bone marrow cellularity and increased spleen cellularity compared to Scf‘’~ or 
Tie2-cre; Sef" ”~ controls (n = 4-11). n, Deletion of Scf from endothelial and 
perivascular stromal cells in Tie2-cre; Lepr-cre; Sc” mice greatly depleted HSCs 
from adult bone marrow (n = 4-11). 0, Total HSC number was significantly 
reduced in Tie2-cre; Lepr-cre; Sof” mice compared to Tie2-cre; Sef” or Lepr- 
cre; Sof” mice (n = 4-11). gor gfp, Sof” allele; f, Scf" allele. Scale bars are 50 jum. 
All data represent mean + s.d. *P < 0.05, **P < 0.01, ***P < 0.001. 


per mouse was significantly reduced in Lepr-cre; Sof” mice (Fig. 5i). 
Lepr-cre-expressing cells thus promote HSC maintenance in the bone 
marrow, but not in the spleen, by producing SCF. 

In limit dilution transplantation studies the frequency of long-term 
multilineage reconstituting cells in Scf'’* and Scf*’” control cells 
was 1/38,311 and 1/38,352, respectively (Fig. 5j). In Lepr-cre; Scf’? 
bone marrow cells the frequency of long-term multilineage recon- 
stituting cells was 1/78,136, significantly lower than in Scf*/* and 
Scf*’? controls (Fig. 5j). Thus conditional deletion of Scf from 
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Lepr-cre-expressing perivascular stromal cells depletes HSCs from 
adult bone marrow. The frequency of GFP* cells in the bone marrow 
of Lepr-cre; Scf’” mice did not significantly differ from Scf?’* con- 
trols (Supplementary Fig. 10), suggesting that Scf deletion did not lead 
to the death of Lepr-expressing cells. 

The HSC depletion observed in Lepr-cre; Scf’” mice did not reflect 
a developmental effect of SCF expression by Lepr-cre-expressing cells 
as no HSC depletion was detected in the liver of newborn Lepr-cre; 
Scf’? mice (Fig. 5k). Furthermore, the magnitude of the HSC deple- 
tion increased with time in the adult bone marrow (Fig. 5h, 1). 

To test whether deletion of Scf from endothelial and Lepr-expressing 
perivascular cells has additive effects on HSC depletion we analysed 
8-week-old Tie2-cre; Lepr-cre; Scf” mice. Bone marrow cellularity was 
significantly reduced in Tie2-cre; Lepr-cre; Scf” mice compared to 
Tie2-cre; Scf’ and Lepr-cre; Scf” mice (Fig. 5m). Spleen cellularity 
was significantly increased in Tie2-cre; Lepr-cre; Scf” mice compared 
to Soft ’~ or Tie2-cre; Scf”” mice (Fig. 5m). HSC frequency was sig- 
nificantly reduced in the bone marrow of Tie2-cre; Lepr-cre; Scf” mice 
compared to Tie2-cre; Sof” or Lepr-cre; Scf” mice (Fig. 5n). The 
frequency and absolute number of HSCs in the bone marrow of 
Tie2-cre; Lepr-cre; Scf” mice was less than 5% of wild-type levels 
(Fig. 5m-o). This suggests that endothelial and Lepr-expressing peri- 
vascular stromal cells are the major sources of SCF for HSC mainten- 
ance in normal adult bone marrow and that deletion of Scf from each 
cell population has additive effects on HSC depletion. 

qRT-PCR revealed that endothelial and Lepr-cre-expressing peri- 
vascular cells expressed both long and short splice isoforms of Scf, 
rendering both cell types capable of expressing membrane-bound and 
soluble SCF (Supplementary Fig. 11). The levels of both isoforms of 
Scf in the two cell populations were significantly higher than in whole 
bone marrow cells, although Lepr-cre-expressing cells expressed 
much higher levels of both isoforms compared to endothelial cells 
(Supplementary Fig. 11). 


Discussion 


Our data demonstrate that HSCs reside in a perivascular niche in 
which endothelial and Lepr-expressing perivascular stromal cells 
are two functionally important components of the niche (Supplemen- 
tary Fig. 12). The simplest interpretation of our data is that both cell 
types produce SCF for the maintenance of HSCs in adult bone 
marrow; however, endothelial cells also produce SCF for HSC 
maintenance/expansion during development so it is possible that 
the depletion of bone marrow HSCs in adult Tie2-cre; Scf”~ mice 
reflects a developmental effect of endothelial SCF. Endothelial cells 
and perivascular stromal cells are probably not the only components 
of the HSC niche, as other cell types probably contribute through 
mechanisms other than SCF production (for example, refs 32, 33). 
Lepr-cre-expressing stromal cells did not express endogenous nestin 
(Supplementary Fig. 9c). Nestin-cre- or nestin-creER-mediated dele- 
tion of Scf did not deplete HSCs (Fig. 3j-1 and Supplementary Figs 6 
and 7). However, Lepr-cre-expressing perisinusoidal cells do partially 
overlap with nestin-GFP-expressing perivascular cells (Supplemen- 
tary Fig. 8 and Supplementary Fig. 9a-c). The Lepr-cre-expressing 
stromal cells therefore include stromal cells that express certain nestin 
transgenes, consistent with previous work’’, and may also include 
Cxcl12-abundant reticular cells’®. Perivascular stromal cells are 
probably heterogeneous and may include multiple cell types that con- 
tribute to HSC maintenance through different mechanisms. 
Although we have partially characterized the bone marrow niche 
for HSCs in adult mice under homeostatic conditions, other studies 
will be required to functionally characterize HSC niches in other 
haematopoietic tissues and after haematopoietic stress. 


METHODS SUMMARY 


Targeting vectors for making Scf” and Scf" mice were constructed by recom- 
bineering’. The Frt-flanked Neo cassette was removed by mating with Flpe 
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mice®®, Scf*” and Scf" mice were backcrossed onto a C57BL/Ka background before 
analysis. Mice used in this study included Ubc-creER**, CMV-cre’’, Vav1-cre’*, 
nestin-cre**, Tie2-cre*', Lepr-cre’ and loxP-EYFP” (all from the Jackson 
Laboratory), Col2.3-cre*°, nestin-creER” and nestin-GFP™’. All were maintained 
on a C57BL/Ka background. Unless otherwise indicated, data always reflect 
mean ~ s.d. and two-tailed Student’s t-tests were used to assess statistical signifi- 
cance (*P < 0.05, **P< 0.01, ***P< 0.001). 


Full Methods and any associated references are available in the online version of 
the paper at www.nature.com/nature. 
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METHODS 


Mice. Targeting vectors for making Scf? and Scf" mice were constructed by 
recombineering™. Linearized targeting vector was electroporated into W4 
(Sc? ) or Bruce4 (Scf") embryonic stem (ES) cells. Positive clones were identified 
by Southern blotting and injected into B6 or C57BL/6-Tyr“”) blastocysts. 
Chimaeric mice were bred with B6 or C57BL/6-Tyr*”) mice to obtain germline 
transmission. The Frt-flanked Neo cassette was subsequently removed by mating 
with Flpe mice**. Sc” mice were backcrossed at least five times and Scf" mice 
were backcrossed at least three times onto a C57BL/Ka background before ana- 
lysis. Note that Scf" mice were backcrossed fewer times before analysis because 
Bruce4 ES cells were from a substantially C57BL/6 background. Ubc-creER”, 
CMV-cre*’, Vav1-cre**, nestin-cre**’, Tie2-cre*', Lepr-cre” and loxP-EYFP”’ mice 
were obtained from the Jackson Laboratory and maintained by crossing with 
C57BL/Ka mice. Col2.3-cre mice on a C57BL/6 background were provided by 
B. Cream and F. Liu’. Nestin-creER mice were obtained from G. Fishell and 
backcrossed onto a C57BL/Ka background”. Nestin-GFP mice were as previously 
described*’. C57BL/6-SJL (CD45.1) mice were used as recipients in transplanta- 
tion experiments. All mice were housed in the Unit for Laboratory Animal 
Medicine at the University of Michigan. All protocols were approved by the 
University of Michigan Committee on the Use and Care of Animals. Unless 
otherwise indicated, data are mean + s.d. and two-tailed Student’s t-tests were 
used to assess statistical significance (*P < 0.05, **P < 0.01, ***P < 0.001). 
Genotyping PCR. Primers for genotyping Scf were: OLD291, 5’-CCCGC 
AGCTCTGGTATATTTGC-3'; OLD292, 5’-CGGACACGCTGAACTTGTGG- 
3’; and OLD360, 5’-AAGCACTTCAGATTCTAGGG-3’. Primers for genotyp- 
ing Scf" were: OLD301, 5’-GGAAAAGAACCAAGTGAAGTC-3’; and OLD302, 
5'-GTCCGCAGCAAGCTCACCAGC-3'. Primers for genotyping Scf were: 
OLD301, 5’-GGAAAAGAACCAAGTGAAGTC-3’; and OLD304, 5'-ACGG 
GGAAAGACCTCCGGTCC-3’. Primers for genotyping Tie2-cre were: 
OLD438, 5'-CCTGTGCTCAGACAGAAATG-3’; and OLD435, 5’-GGCAAA 
TTTTGGTGTACGGTC-3’. Primers for genotyping Lepr-cre were: OLD434, 
5'-CATTGTATGGGATCTGATCTGG-3’; and OLD435, 5’-GGCAAATTTTG 
GTGTACGGTC-3’. 

Tamoxifen administration. Tamoxifen citrate (Sigma or Spectrum Chemical) 
was administrated in chow at 400 mg kg‘ with 5% sucrose (Harlan). Mice were 
fed tamoxifen chow for 1-5 months before being analysed. 

Long-term competitive reconstitution assay and limit dilution assay. Adult 
recipient mice were lethally irradiated by a Cesium 137 GammaCell40 Irradiator 
(MDS Nordia) at 300 rad per minute with two doses of 540 rad (total 1,080 rad) 
delivered at least 2h apart. Cells were transplanted by retro-orbital venus sinus 
injection of anaesthetized mice. 3 X 10° fetal liver or bone marrow cells were 
transplanted along with 3 X 10° recipient bone marrow cells into each irradiated 
mouse unless otherwise indicated. Mice were maintained on antibiotic water 
(neomycin sulphate 1.11 g]~! and polymixin B 0.121 g1~') for 14 days then 
switched to regular water. Recipient mice were periodically bled to assess the 
level of donor-derived blood lineages, including myeloid, B and T cells for at least 
16 weeks. Blood was subjected to ammonium chloride potassium red cell lysis 
before antibody staining. Antibodies including anti-CD45.2 (104), anti-CD45.1 
(A20), anti-Grl (8C5), anti-Mac-1 (M1/70), anti-B220 (6B2) and anti-CD3 
(KT31.1) were added to stain cells. Stained samples were analysed by flow cyto- 
metry. For limit dilution assay on Ubc-creER; Scf" “" mice, three doses of donor 
bone marrow cells (2 X 10*, 5 X 10* and 1.5 X 10°) along with 3 10° recipient 
bone marrow cells were transplanted into irradiated mice. For limit dilution assay 
on Lepr-cre; Sef! mice, 2 X 10*, 5 X 10* or 1 X 10° bone marrow cells along 
with 3 X 10° recipient bone marrow cells were competitively transplanted into 
irradiated mice. After 16 weeks, the percentage of mice that were not multilineage 
reconstituted by donor haematopoietic cells was plotted against cell dose. ELDA 
software was used to calculate HSC frequency and statistical significance (http:// 
bioinf.wehi.edu.au/software/elda/). 

Flow cytometry. Bone marrow cells were isolated by flushing the long bones or by 
crushing the long bones with mortal and pestle in Ca** and Mg’* free HBSS with 
2% heat-inactivated bovine serum. Spleen cells were obtained by crushing 
the spleen between two glass slides. The cells were drawn by passing through a 
25G needle several times and filtered with a 70 um nylon mesh. The following 
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antibodies were used to stain HSCs: anti-CD150 (TC15-12F12.2), anti-CD48 
(HM48-1), anti-CD41 (MWReg30), anti-Sca-1 (E13-161.7), anti-cKit (2B8), lin- 
eage markers (anti-Ter119, anti-B220 (6B2), anti-Grl (8C5), anti-CD2 (RM2-5), 
anti-CD3 (17A2), anti-CD5 (53-7.3) and anti-CD8 (53-6.7). DAPI was used to 
exclude dead cells. For flow cytometric analysis of Scf-GFP* or Lepr-cre; loxP- 
EYEP* stromal cells, bone marrow was flushed using HBSS with 2% bovine 
serum. Then whole bone marrow was digested with Collagenase IV 
(200 U ml ') and DNase I (200 U ml) at 37 °C for 15 min. Samples were then 
stained with antibodies and analysed by flow cytometry. Anti-CD140a (APAS5), 
anti-CD140b (APB5), anti-CD45 (30F-11) and anti-Ter119 antibodies were used 
to stain perivascular stromal cells. For analysis of bone marrow endothelial cells, 
mice were intravenously injected with 10 ug per mouse Alexa Fluor 647 conju- 
gated anti- VE-cadherin antibody (BV 13, eBiosciences)'*. Ten minutes later, the 
long bones were removed and bone marrow was flushed, digested and stained as 
above. Samples were run on FACSAria II or FACSCanto II flow cytometers. Data 
were analysed by FACSDiva (BD) or FlowJo (Tree Star) software. 
Methylcellulose culture. Cells were sorted into methylcellulose culture medium 
(3434, Stemcell Technology) and incubated at 37 °C as described’. 

Bone sectioning and immunostaining. Freshly dissected long bones were fixed 
in a formalin-based fixative at 4 °C for 3h. Then the bones were embedded in 8% 
gelatin in PBS. Samples were snap frozen with liquid nitrogen and stored at 
—80°C. Bones were sectioned using CryoJane (Instrumedics). Sections were 
dried overnight at room temperature (~22 °C) and stored at —80 °C. Sections 
were re-hydrated in PBS for 5 min before immunostaining. 5% goat serum in PBS 
was used for blocking. Primary antibodies were applied to the slides for 1h at 
room temperature followed by secondary antibody incubation for 30 min at room 
temperature with repetitive washes in between. Slides were mounted with anti- 
fade prolong gold (Invitrogen) and images were acquired on an Olympus IX81 
microscope or a Leica SP5 confocal microscope. Primary antibodies were: 
chicken-anti-GFP (Aves), rat-anti-endoglin (eBioscience), goat-anti-osteopontin 
(R&D), rat-anti-mouse pan-endothelial cell antigen (Meca32, Biolegend) and 
rabbit-anti-Cherry (Clontech). 

qRT-PCR. Cells were sorted directly into Trizol. Total RNA was extracted 
according to manufacturer’s instructions. Total RNA was subjected to reverse 
transcription or amplification using the WT-Ovation Pico RNA Amplification 
System (Nugen). Quantitative real-time PCR was run using SYBR green on a 
LightCycler 480 (Roche). B-Actin was used to normalize the RNA content of 
samples. Primers used in this study were: Scf: OLD405, 5’-TTGTTACCT 
TCGCACAGTGG-3’; and OLD406, 5’-AATTCAGTGCAGGGTTCACA-3’; 
Cxcl12: OLD35, 5'-TGCATCAGTGACGGTAAACCA-3’; and OLD36, 5’- 
GTTGTTCTTCAGCCGTGCAA-3’; Alpl: OLD164, 5'-TTGTGCCAGAGAAA 
GAGAGAGA-3’; and OLD165, 5’- GITTCAGGGCATTTTTCAAGGT-3’; 
Tie2: OLD158, 5'-ATGTGGAAGTCGAGAGGCGAT-3’; and OLD159, 5’- 
CGAATAGCCATCCACTATTGTCC-3'; Lepr: OLD418, 5'-TGATGTGTCA 
GAAATTCTATGTGG-3’; and OLD419, 5’- TGCCAGGTTAAGTGCA 
GCTAT-3’; Scf long form: 5'- GCCAGAAACTAGATCCTTTACTCCTGA-3’; 
and 5'-ACATAAATGGTTTTGTGACACTGACTCTG-3’; Scf short form: 5’- 
CCCGAGAAAGGGAAAGCCG-3’; and 5’-ATTCTCTCTCTTTCTGTTGCAA 
CATACTT-3’;  B-actin: OLD27, GCTCTTTTCCAGCCTTCCTT-3’; and 
OLD28, 5’-CTTCTGCATCCTGTCAGCAA-3’. 

Gene expression profiling. Three independent, fresh isolated aliquots of 
approximately 5,000 Scf-GFP* cells or whole bone marrow cells were flow cyto- 
metrically sorted into Trizol. Total RNA was extracted and amplified using the 
WT-Ovation Pico RNA Amplification system (Nugen) following manufacturer's 
instructions. Sense strand cDNA was generated using the WT-Ovation Exon 
Module (Nugen). Then, cDNA was fragmented and labelled using FL-Ovation 
DNA Biotin Module V2 (Nugen). The labelled cDNA was hybridized to 
Affymetrix Mouse Gene ST 1.0 chips following the manufacturer’s instructions. 
Expression values for all probes were normalized and determined using the 
robust multi-array average (RMA) method”. 


43. Irizarry, R. A. et al. Exploration, normalization, and summaries of high density 
oligonucleotide array probe level data. Biostatistics 4, 249-264 (2003). 
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A PGCl-a-dependent myokine that 
drives brown-fat-like development of 
white fat and thermogenesis 


Pontus Bostrém!, Jun Wu, Mark P. Jedrychowski?, Anisha Kordel, Li Ye!, James C. Lo‘, Kyle A. Rasbach!, 
Elisabeth Almer Bostrém®, Jang Hyun Choi', Jonathan Z. Long’, Shingo Kajimura‘*, Maria Cristina Zingaretti’, Birgitte F. Vind®, 
Hua Tu’, Saverio Cinti?, Kurt Hajlund®, Steven P. Gygi* & Bruce M. Spiegelman! 


Exercise benefits a variety of organ systems in mammals, and some of the best-recognized effects of exercise on muscle 
are mediated by the transcriptional co-activator PPAR-y co-activator-1 0 (PGCl-a). Here we show in mouse that 
PGCl-a expression in muscle stimulates an increase in expression of FNDC5, a membrane protein that is cleaved and 
secreted as a newly identified hormone, irisin. Irisin acts on white adipose cells in culture and in vivo to stimulate UCP1 
expression and a broad program of brown-fat-like development. Irisin is induced with exercise in mice and humans, and 
mildly increased irisin levels in the blood cause an increase in energy expenditure in mice with no changes in movement 
or food intake. This results in improvements in obesity and glucose homeostasis. Irisin could be therapeutic for human 
metabolic disease and other disorders that are improved with exercise. 


PGC1-z is a transcriptional co-activator that mediates many biological 
programs related to energy metabolism. Originally described as a co- 
activator of PPAR-y that modulated expression of uncoupling protein 1 
(UCP1) and thermogenesis in brown fat’, it has also been shown to 
control mitochondrial biogenesis and oxidative metabolism in many 
cell types. PGC1-o is induced in muscle by exercise and stimulates 
many of the best-known beneficial effects of exercise in muscle: 
mitochondrial biogenesis, angiogenesis and fibre-type switching”. It 
also provides resistance to muscular dystrophy and denervation-linked 
muscular atrophy’. The health benefits of elevated muscle expression of 
PGC1-% may go beyond the muscle tissue itself. Transgenic mice with 
mildly elevated muscle PGC1-« are resistant to age-related obesity and 
diabetes and have a prolonged lifespan*. This suggests that PGC1-o 
stimulates the secretion of factors from skeletal muscle that affect the 
function of other tissues. Here we show that PGC1-« stimulates the 
expression of several muscle gene products that are potentially secreted, 
including FNDC5. The Fndc5 gene encodes a type I membrane protein 
that is processed proteolytically to form a newly identified hormone 
secreted into the blood, termed irisin. Irisin is induced in exercise and 
activates profound changes in the subcutaneous adipose tissue, stimu- 
lating browning and UCP1 expression. Importantly, this causes a sig- 
nificant increase in total body energy expenditure and resistance to 
obesity-linked insulin resistance. Thus, irisin action recapitulates some 
of the most important benefits of exercise and muscle activity. 


Muscle PGCl-a transgenics 

Mice with transgenically increased PGC1-« in muscle are resistant to 
age-related obesity and diabetes’, suggesting that these animals have a 
fundamental alteration in systemic energy balance. We therefore ana- 
lysed the adipose tissue of the PGC1-« transgenic mice for expression of 
genes related to a thermogenic gene program and genes characteristic 


of brown fat development. There were no significant alterations in the 
expression of brown-fat-selective genes in the interscapular brown 
adipose tissue or in the visceral (epididymal) white adipose tissue 
(Fig. la). However, the subcutaneous fat layer (inguinal), a white 
adipose tissue that is particularly prone to ‘browning’ (that is, forma- 
tion of multilocular, UCP1-positive adipocytes), had significantly 
increased levels of Ucp1 and Cidea messenger RNAs (Fig. 1b). We also 
observed increased UCP1 protein levels and more UCP1-positive 
stained multilocular cells in transgenic mice compared to controls 
(Fig. 1c, d). There are recent reports that exercise causes a mild increase 
in the expression ofa thermogenic gene program in the visceral adipose 
tissue, a depot that has minimal expression of these genes’. As it is the 
subcutaneous white adipose depot that has the greatest tendency to 
turn on a thermogenic gene program and alter the systemic energy 
balance of mice’, we re-investigated this with regard to browning of the 
white adipose tissues in two types of exercise. Similar to what has been 
reported’, a twofold increase in Ucp1 mRNA expression was observed 
in the visceral, epididymal fat with 3 weeks of wheel running (Sup- 
plementary Fig. 1). However, a much larger change (approximately 25 
fold) was seen in the same mice in the subcutaeneous inguinal fat 
depot. Similarly, a small increase in Ucp1 mRNA expression was seen 
in the epididymal fat with repeated bouts of swimming in warm (32 °C) 
water (Supplementary Fig. 1); however a very large increase (65 fold) 
was observed in the inguinal white depot (Supplementary Fig. 1). 
Thus, muscle-specific expression of PGCla drives browning of sub- 
cutaneous white adipose tissue, possibly recapitulating part of the 
exercise program. 


Media from PGCl-a-expressing myocytes 


The effect on browning of the adipose tissues from PGC1-a-expressing 
muscle could be due to direct muscle-fat signalling or to another, 
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Figure 1 | Muscle-specific PGC1-a transgenic mice have increased brown/ 
beige fat cells in the subcutaneous depot. a, b, Quantitative polymerase chain 
reaction (qPCR) against brown fat and thermogenic genes in epididymal fat, 
BAT (a) and subcutaneous, inguinal (b) fat depots in muscle creatine kinase 
promoter (MCK)-PGC1-« transgenic mice or littermate controls. n = 7 for 
each group, repeated in a separate cohort with similar results. c, Representative 
immunohistochemistry against UCP1 in the inguinal depot from indicated 
mice. d, Western blot against UCP1 in the inguinal fat depot (n = 3 and 
repeated in an independent cohort with similar results). e, qPCR against 
indicated genes in adipocytes differentiated for 6 days from stromo vascular 
fraction (SVF) cells. This was done in the presence of conditioned media from 
primary myocytes with forced expression of GFP or PGC1-o (representative for 
three independent experiments). Data are presented as mean + s.e.m., and 
*P <0.05 compared to control group. Student’s t-test was used for single 
comparisons. NS, not significant. 


indirect mechanism. To investigate this, we treated cultured primary 
subcutaneous adipocytes with serum-free media conditioned by myo- 
cytes expressing PGC1-z or cells expressing green fluorescent protein 
(GFP). As shown in Fig. le, the media from cells expressing ectopic 
PGC1-a increased the mRNA levels of several brown-fat-specific genes 
(Fig. le). This suggested that PGC1-« causes the muscle cells to secrete 
a molecule(s) that can induce a thermogenic gene program in the cells. 


Candidate, secreted PGCl-a-dependent proteins 


We used a combination of Affymetrix-based gene expression arrays 
and an algorithm that predicts protein secretion to search for proteins 
that could mediate the browning of adipose tissues under the control 
of muscle PGC1-« (Methods). Proteins with mitochondrial targeting 
sequences were excluded, and all candidates were validated in gain-of- 
function systems for PGC1-« in vivo (Methods). Five proteins were 
identified as PGCl-« target genes in muscle and as likely to be 
secreted: IL-15, FNDC5, VEGF-B, LRG1 and TIMP4 (Fig. 2a). 
Conversely, expression of these genes was reduced in mice with 
muscle-specific deletion of PGC1-« (Supplementary Fig. 2). Furthermore, 
they were also found to be increased at the mRNA level in muscle from 
exercised mice (Fig. 2b). The expression of this same set of genes was 
also examined in muscle biopsies from human subjects before and after 
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Figure 2 | FNDCS5 is induced with forced PGC1-a expression or exercise, 
and turns on brown/beige fat gene expression. a, qPCR against indicated 
genes in skeletal muscle from MCK-PGC1-o transgenic mice or littermate 
controls (n = 7 from each group). b, qPCR against indicated genes in skeletal 
muscle from sedentary mice or mice given 3 weeks of free wheel running 

(n = 10 from each group). Mice were rested for 12 h before being killed. 

c, mRNA expression levels from human muscle biopsies before and after 10 
weeks of endurance exercise training (8 subjects included). All data points are 
normalized to baseline levels. d, SVF from the inguinal fat depot, differentiated 
into adipocytes for 6 days in the presence of saline or recombinant FNDC5 
(20 nM), IL-15 (10 pM) or VEGF-f (50 1M). The graph show normalized 
mRNA levels of indicated genes. This experiment was repeated three times with 
similar results. For d, we performed one-way ANOVA tests where *P < 0.05 for 
the effect of FNDC5 on Ucp1 and Cidea expression. All other statistics were 
performed using Student’s t-test, and bar graphs are mean + s.e.m. 


a controlled period of endurance exercise’ (Fig. 2c). FNDC5, VEGFB 
and TIMP4 mRNAs were all significantly induced in humans with 
exercise. IL-15 has previously been reported as being secreted from 
muscle under the influence of exercise®, while the regulation of 
FNDC5, VEGF-8, LRG1 and TIMP4 by exercise has not been 
described. FNDC5 mRNA and brown fat markers in subcutaneous 
fat were not regulated by acute exercise, and FNDC5 mRNA was not 
induced by exposure to cold (4 °C) for 6h (Supplementary Fig. 2). 


Fndc5 induces a browning in vitro 


Several of the proteins encoded by these genes were commercially 
available, so they were applied directly to primary subcutaneous white 
adipocytes during differentiation. Factors such as IL-15 or VEGF-B 
had minimal effects on the expression of Ucp1 and other brown fat 
genes at concentrations of 200 nM or higher. However, FNDC5 pro- 
moted a sevenfold induction of Ucp1 mRNA at a concentration of 
20 nM (Fig. 2d). The transcriptional changes induced by FNDC5 were 
addressed on a global scale using gene expression arrays (Supplemen- 
tary Fig. 3). Notably, Ucp1 and three other known brown fat genes, 
Elovl3, Cox7al and Otop1, were among the eight most upregulated 
genes (Supplementary Fig. 4). Conversely, genes characteristic of 
white fat development were downregulated, such as leptin (Sup- 
plementary Fig. 3). These data indicate that the activation of browning 
and thermogenic genes by FNDCS is a major part of the action of this 
polypeptide on these cells. 

The effects of FNDC5 treatment were remarkably robust; Ucp1 
mRNA was increased 7-500 fold in more than ten experiments using 
FNDC5 at a concentration of 20 nM (Fig. 3a and Supplementary Fig. 5). 
Moreover, we could demonstrate a clear dose-dependence, with an 
effective range between 20-200 ng ml! (1.5-15 nM) (Fig. 3b). In con- 
trast, BMP7, reported as a potent inducer of browning’, had a much 
smaller effect (maximum of twofold) on the same cells at 3.3 uM 
(Fig. 3a). 
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Figure 3 | FNDCS5 is a potent inducer of the brown/beige fat gene program 
a, SVF from the inguinal fat depot was differentiated into adipocytes for 6 days 
in the presence of saline, recombinant FNDC5 (20 nM) or BMP7 (3.3 1M). The 
graph show normalized mRNA levels for indicated genes. Similar results were 
obtained in more than 10 experiments with the fold induction of Ucp1 between 
7-500 fold. b, mRNA levels of Ucp1 from inguinal-derived SVF treated with 
FNDCS5 for 6 days at indicated doses. ¢, Clark electrode measurements of 
oxygen consumption in SVF from the inguinal fat depot, differentiated into 
adipocytes for 6 days in the presence of saline or recombinant FNDC5 (20 nM). 
Data are representative for three independent experiments and normalized to 
total cellular protein. d, qPCR of Ucp1 mRNA from SVF, differentiated into 
adipocytes, and treated with FNDC5 or saline for 6 days followed by addition of 
forskolin for 8 h. tP < 0.05 compared to forskolin treatment. e, qPCR of Ppara 
after 6 days of FNDCS5 treatment (20 nM) during differentiation of primary 
SVF. f, SVF differentiated into adipocytes and treated with FNDC5 and/or 
GW6471 for 6 days. The graph shows qPCR of indicated genes. +P < 0.05 
compared to FNDC5 treatment. For d and f, combined one- and two-way 
ANOVA was used. *P < 0.05. All other statistics were performed using 
Student’s t-test, and bar graphs are mean + s.e.m. 


We also used immunohistochemistry to study cells treated with 
FNDCS and observed a robust increase in UCP1-positive adipocytes 
with multilocular lipid droplets (Supplementary Fig. 4). Electron 
microscopic analysis of FNDC5-treated cells showed a higher density 
of mitochondria compared to control cells, consistent with a brown- 
fat-like phenotype and elevated mitochondrial gene expression 
(Supplementary Fig. 5). The sizes of mitochondria, however, were 
similar between groups (Supplementary Fig. 4). Lastly, measurements 
of oxygen consumption provided functional evidence of increased 
energy expenditure with FNDC5 exposure. Total oxygen consump- 
tion was greatly increased (100%) by 20nM of FNDCS5, and the 
majority of this respiration was uncoupled (Fig. 3c). Thus, FNDC5 
potently induces thermogenesis and a brown-fat-like gene program in 
cultured adipocytes. In marked contrast, FNDC5 showed little or no 
effects on the classical brown fat cells isolated from the interscapular 
depot (Supplementary Fig. 4). 

We sought to define the timeframe during the differentiation pro- 
cess when FNDC5 was effective. FNDC5 was applied to cells in 2-day 
windows from day 0-6, and this was compared to cells to which the 
protein was added during the entire 6-day differentiation process. As 


ARTICLE 


shown in Supplementary Fig. 5, treatment during days 3-6 was effec- 
tive at inducing Ucp1 mRNA, although notas effective as when FNDC5 
was present throughout the differentiation process. Furthermore, 
treatment during the initial 2 days had no effect on UCP1 levels, sug- 
gesting that FNDC5 acts mainly during the differentiation process of 
cells committed to the adipocyte lineage. Cyclic AMP (cAMP) is an 
important signalling pathway in thermogenesis, promoting the brown- 
fat gene program downstream of f-adrenergic stimulation. We 
therefore asked whether FNDCS5 effects were additive or redundant 
with cAMP signalling. As shown in Fig. 3d, FNDC5-exposed cells 
increase UCP1 expression in an additive manner when exposed to 
forskolin, an adenyl cyclase activator. Two-way ANOVA tests demon- 
strated that there was a significant (P< 0.01) interaction between 
FNDCS5 and forskolin treatment, indicating synergistic effects. 


PPAR-a acts downstream of Fndc5 


A key question is how FNDCS is able to stimulate a thermogenic gene 
program. One potentially important transcription factor induced by 
FNDCS5, identified using gene expression arrays, was PPAR-«. This 
nuclear receptor has been shown to drive Ucp1 expression and several 
other genes involved in browning of adipose cells'®. Ppara is increased 
threefold at the mRNA level by FNDCS treatment (Fig. 3e). Importantly, 
the FNDC5-mediated increase in UCP1 was significantly reduced when 
cells were simultaneously subjected to the selective PPAR-« antagonist 
GW6471 (Fig. 3f). The functional interaction between the FNDC5 and 
GW6471 treatments on UCP] expression was confirmed using two-way 
ANOVA (P < 0.05). Conversely, the PPAR-« antagonist normalized the 
reduction seen in white adipose genes leptin and adiponectin after 
FNDCS5 treatment. Together, these data indicate that FNDC5 acts to 
induce Ucp1 gene expression, at least in part, via PPAR-«. 


Irisin is a cleaved and secreted fragment of Fndc5 


FNDCS5 (also known as FRCP2 and PeP), was previously shown to 
have a signal peptide, two fibronectin domains and one hydrophobic 
domain that is likely to be membrane inserted’*’” (Fig. 4a). Previous 
studies did not investigate whether part of this protein might be 
secreted''*, With this structure in mind, we considered the possibility 
that FNDC5 might be synthesized as a type I membrane protein, 
followed by proteolytic cleavage and release of the amino (N)- 
terminal part of the protein into the extracellular space. Thus, any 
carboxy (C)-terminal or N-terminal tags would be lost during proces- 
sing of the mature protein or interfere with localization. Indeed, 
expression of a C-terminally Flag-tagged FNDC5 (Fig. 4a) did not 
result in any Flag immunoreactivity in the medium from cells expres- 
sing this construct (Fig. 4b). However, when we immunoblotted the 
same samples with an antibody that recognizes the endogenous 
FNDC5 protein, we could easily detect substantial amounts of 
FNDCS in the media at approximately 32 kilodalton (kDa): this is 
slightly larger than the cellular FNDC5 (Fig. 4b). These data indicate 
that FNDC5 is C-terminally cleaved, secreted and possibly further 
modified. 

Western blot of media fractions with antibodies against wild-type 
FNDCS5 showed multiple bands, suggestive of glycosylation, a common 
feature of secreted proteins. Treatment of supernatants from FNDC5- 
expressing cells with peptide N-glycosidase F (PNGase F) resulted in a 
significant size decrease as detected by SDS—polyacrylamide gel elec- 
trophoresis (SDS-PAGE), from 32 kDa to 20 kDa (Fig. 4c). 

Mass spectrometry was used to determine the sequence of the 
FNDC5-derived polypeptide found in the media (Methods). To do this, 
we fused the N terminus of FNDC5 (without the signal peptide) to the C 
terminus of the crystallisable fragment (Fc) domain of immunoglobulin 
G (IgG). After purification and enzymatic deglycosylation of the 
secreted material, mass spectrometry analyses indicated that secreted 
FNDC5 was truncated at glutamic acid 112 (not including the signal 
sequence), as shown in Fig. 4e. The secreted portion of FNDC5 has 
remarkable conservation between species, with 100% identity between 
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Figure 4 | FNDC5 is proteolytically cleaved and secreted from cells. 
a, Schematic representation of the FNDC5 protein structure (top), Flag-tagged 
FNDCS protein (middle) and irisin (bottom). C, C-terminal domain; 
H, hydrophobic domain; SP, signal peptide. b, HEK 293 cells transfected with a 
vector expressing the C-terminal Flag-tagged FNDC5 (CTE-F5, third panel 
from a), followed by isolation of cell and culture media protein. Samples were 
adjusted for protein content and western blots were performed against the Flag 
antigen (left) or FNDC5 (right). This was repeated in several experiments with 
similar results. Adjusting for volume (instead of protein content) also gave 
similar results. C, cell fraction; M, media fraction. Arrows indicate molecular 
weight. c, HEK 293 cells transfected with a vector expressing FNDC5-CTF, 
followed by isolation of cell and media protein. Respective protein fraction was 
then treated with PNGase F followed by western blot against FNDCS5 after 
SDS-PAGE. d, Representation of the full-length FNDC5 and the irisin 
fragment mapped by mass spectrometry (underlined). 


mice and humans (Supplementary Fig. 6). Because this distinct, 
secreted polypeptide has not been previously described and signals 
from muscle to other tissues we named it irisin, after Iris, the Greek 
messenger goddess. 

The ability of the anti-FNDC5 antibodies to react with irisin allowed 
us to investigate the contribution made by irisin to the browning 
activity caused by muscle cells expressing PGC1-«. Media conditioned 
by muscle cells that had forced expression of PGC1-« were incubated 
with control or anti-FNDCS5 antibodies before they were applied to the 
fat cell cultures. As shown in Supplementary Fig. 6, the FNDC5 
antibody caused a marked reduction in the ability of the PGCl-o 
conditioned media to induce Ucp1 and Cidea mRNA in the primary 
inguinal cells. This suggests that irisin accounts for a significant frac- 
tion of this activity found in secreted media from muscle cells with 
forced PGC1-a expression. We cannot, however, exclude the possibility 
that other factors might also contribute to this response. 


Irisin is present in mouse and human plasma 


We next analysed levels of irisin in plasma from wild-type mice, using 
intravenous adenoviral delivery of full-length FNDC5 as a positive 
control. This method results in strong forced expression from the liver 
and potential secretion to the plasma, where we detected irisin using 
western blot after albumin/IgG-depletion and deglycosylation. As 
seen in Fig. 5a, FNDC5-expressing virus resulted in a clear increase 
in an immunoreactive band at 22 kDa. Importantly, this was the only 
band altered on these blots. Using western blots of purified FNDC5 
protein as a quantitative standard, irisin is present in the plasma of 
control mice at a concentration of approximately 40nM. We also 
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Figure 5 | Detection of irisin in mouse and human plasma. a, Plasma from 
mice injected intravenously with adenoviral vectors (Ad) expressing FNDCS5 or 
GFP was subjected to western blot against FNDCS. b, Western blot against 
irisin in plasma from muscle-specific PGC1-«% knockout (MKO) mice or 
control littermates (flox/flox). WT, wild type. c, Western blot against irisin in 
plasma from wild-type mice or two healthy human subjects (representative for 
8 subjects analysed identically). d, Western blot against irisin in serum from 
control or 3 weeks exercised mice, followed by 12h rest. Bottom panel shows 
quantification of the bands. e, Western blot analysis of irisin in plasma from 
human subjects before and after 10 weeks of endurance exercise. Eight subjects 
in total were analysed; quantification after internal normalization is displayed 
in bottom panel. For all plasma analyses, samples were depleted for albumin/ 
IgG, and deglycosylated using PNGase F. Arrow indicated irisin band. Data are 
presented as mean + s.e.m., and *P < 0.05 compared to control group. 
Student’s t-test was used for single comparisons. 


analysed plasma of PGC1-o muscle-specific knockout mice as a nega- 
tive control, and the irisin band at 22 kDa was decreased by 72% in 
these animals (Fig. 5b). Furthermore, an immunoreactive band of 
identical electrophoretic mobility was found in plasma from healthy 
human subjects (Fig. 5c). This band was greatly diminished when the 
anti-FNDCS5 antibody was neutralized with an excess of antigen 
(Supplementary Fig. 7). 

We examined blood levels of irisin after exercise in mice and 
human subjects. Mice had significantly elevated (65%) plasma con- 
centrations of irisin after they were subjected to 3 weeks of free wheel 
running (Fig. 5d). Similar analyses in healthy adult humans subjected 
to supervised endurance exercise training for 10 weeks revealed a 
twofold increase in the circulating irisin levels compared to the 
non-exercised state (Fig. 5e). Thus, irisin is present in mouse and 
human plasma, and is increased with exercise. The increase in cir- 
culating protein in both species is roughly proportional to the 
increases observed at the mRNA level in muscle (Fig. 2c). 


Irisin reduces obesity and insulin resistance 

We used adenoviral vectors to express full-length FNDC5 (ora control 
GFP) and examined its biological and therapeutic effects. This method 
resulted in a 15-fold increase in liver Fndc5 mRNA, although the liver 
expresses very low endogenous levels of this mRNA. Plasma levels of 
irisin were increased 3-4 fold (Fig. 5a). The mice did not display any 
adverse reaction, and upon gross pathological examination, there was 
no apparent toxicity in any major organ system. There was also no 
increase in plasma AST levels, and inflammatory genes were not sig- 
nificantly altered in the liver when the two groups were compared 
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(Supplementary Fig. 8). Ten days after injection, Ucp1 mRNA was 
increased by 13-fold in the subcutaneous depot relative to the same 
depot in mice receiving the virus expressing GFP (Fig. 6a, b); Cidea was 
also significantly upregulated (Fig. 6a). There were no changes in 
expression of UCP1 in the interscapular classical brown fat (BAT), 
but we did observe a minor elevation in Prdm16 and Pgcla mRNA 
(Supplementary Fig. 8). The changes in gene expression in the sub- 
cutaneous adipose tissues were accompanied by a clear increase in the 
number of UCP1-positive, multilocular adipocytes (Fig. 6c). We did 
not, however, detect any change in body weight in the GFP versus 
FNDC5 groups of animals. We observed similar results in young 
C57BL/6 mice (Supplementary Fig. 9). Thus, moderate increases in 
circulating irisin can induce browning of white adipose tissues in vivo, 
including increased expression of UCP1. 

As activation of the classical brown fat or browning of white fat has 
been shown to improve obesity and glucose homeostasis in vivo’, we 
delivered FNDC5-expressing adenovirus to mice rendered obese and 
insulin-resistance by feeding a high fat diet. We chose C57BL/6 mice 
for these experiments because they are highly prone to diet-induced 
obesity and diabetes. The expression of irisin increased Ucp1 gene 
expression to the same degree as in lean mice (Supplementary Fig. 9). 
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Figure 6 | Irisin induces browning of white adipose tissues in vivo and 
protects against diet-induced obesity and diabetes. a—c, Wild-type BALB/c 
mice were injected with 10'° GEP- or FNDC5-expressing adenoviral particles 
intravenously (n = 7 for each group). a, b, Animals were killed after 10 days and 
inguinal/subcutaneous fat pads were collected and analysed using qPCR 
analysis of indicated mRNAs (a) and western blot against UCP1 

(b). c, Representative images from immunohistochemistry against UCP1 in 
these mice. All results in a-c were repeated two times with similar results. 
d-g, C57BL/6 mice fed a 60% kcal high-fat diet for 20 weeks were intravenously 
injected with GFP- or FNDC5-expressing adenovirus and all analyses were 
done 10 days thereafter (n = 7 for both groups). d, Oxygen consumption at day 
and night. e, Body weights of mice 10 days after injection with indicated 
adenovirus. f, Fasting plasma insulin measured using enzyme-linked 
immunosorbent assay (ELISA). g, Intraperitoneal glucose tolerance test. 

h, Mice were injected intraperitoneally with 50 tg of rabbit IgG or a rabbit anti- 
FNDC5 antibody (ab) and were either subjected to swimming for 7 days or kept 
sedentary (n = 10 for all groups). Data show mRNA expression levels from 
inguinal white adipose tissue. All data in d—j were performed at least twice in a 
separate mouse cohort with similar results. +P < 0.05 compared to exercise and 
IgG. One-way ANOVA was used for statistics in h. All other statistics were 
performed using Student’s t-test, and bar graphs are mean ~ s.e.m. 
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There was also an elevation in expression of several mitochondrial 
genes (Supplementary Fig. 8). Notably, these changes occurred with 
only moderately increased irisin blood levels, threefold compared to the 
GFP-expressing mice. This effect was accompanied with a large 
increase in oxygen consumption (Fig. 6d), consistent with the gene 
expression data, and body weights of the irisin-expressing mice were 
slightly reduced after 10 days compared to GFP-expressing controls 
(Fig. 6e). These effects of irisin on mitochondrial gene expression in the 
fat were not seen in skeletal muscle in vivo or in cultured myocytes 
(Supplementary Fig. 10). Irisin expression in the mice fed a high fat diet 
caused a significant improvement in glucose tolerance when compared 
to the control mice expressing GFP. In addition, fasting insulin was also 
reduced (Fig. 6f, g). These data illustrate that even moderately increased 
levels of circulating irisin potently increase energy expenditure, reduce 
body weight and improve diet-induced insulin resistance. 

Finally, we asked whether irisin is required for the exercise-induced 
effects on the subcutaneous white fat. Injection of anti-FNDC5 
antibodies into mice before 10 days of swim training dramatically 
reduced the effect of this exercise on Ucp1 and Cidea gene expression, 
compared to injection of control antibodies (Fig. 6h). In contrast, 
Prdm16 mRNA levels were not increased with exercise and were also 
not affected by the anti- FNDC5 antibodies. Thus, irisin is required for 
a substantial part of the effect of exercise on these gene expression 
events in the browning of white fat. 


Discussion 


Exercise has the capacity to improve metabolic status in obesity and type 
2 diabetes, but the mechanisms are poorly understood. Importantly, 
exercise increases whole body energy expenditure beyond the calories 
used in the actual work performed"*. However, the relative contribution 
of the adipose tissues to this phenomenon has not been clarified. 
Because transgenic mice expressing PGCl-« selectively in muscle 
showed a remarkable resistance to age-related obesity and diabetes’, 
we sought factors secreted from muscle under the control of this co- 
activator that might increase whole body energy expenditure. We 
describe a new polypeptide hormone, irisin, which is regulated by 
PGC1-z, secreted from muscle into blood and activates thermogenic 
function in adipose tissues. Irisin is remarkable in several respects. First, 
it has powerful effects on the browning of certain white adipose tissues, 
both in culture and in vivo. Nanomolar levels of this protein increase 
UCP1 in cultures of primary white fat cells by 50 fold or more, resulting 
in increased respiration. Perhaps more remarkable, viral delivery of 
irisin that causes only a moderate increase (~3 fold) in circulating levels 
stimulates a 10-20 fold increase in UCP1, increased energy expenditure 
and an improvement in the glucose tolerance of mice fed a high fat diet. 
As this is in the range of increases seen with exercise in mouse and man, 
it is likely that irisin is responsible for at least some of the beneficial 
effects of exercise on the browning of adipose tissues and increases in 
energy expenditure. It is important to note that the evidence provided 
here does not exclude a role for other tissues besides muscle in the 
biological regulation and secretion of irisin. 

Second, the cleaved and secreted portion of FNDC5, the hormone 
irisin, is highly conserved in all mammalian species sequenced. Mouse 
and human irisin are 100% identical, compared to 85% identity for 
insulin, 90% for glucagon and 83% identity for leptin. This certainly 
implies a highly conserved function that is likely to be mediated by a 
cell surface receptor. The identity of such a receptor is not yet known. 

On the basis of the gene structure of Fndc5, with a signal peptide that 
was evidently missed in previous studies’’, we considered that FNDC5 
might be a secreted protein. Indeed, we have observed that the signal 
peptide is removed, and the mature protein is further proteolytically 
cleaved and glycosylated, to release the 112-amino-acid polypeptide 
irisin. The cleavage and secretion of irisin is similar to the release/ 
shedding of other transmembrane polypeptide hormones and hormone- 
like molecules such as epidermal growth factor (EGF) and transforming 
growth factor- (TGF-«). 
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As the conservation of calories would probably provide an overall 
survival advantage for mammals, it seems paradoxical that exercise 
would stimulate the secretion of a polypeptide hormone that increases 
thermogenesis and energy expenditure. One explanation for increased 
irisin expression with exercise in mouse and man may be that it evolved 
as a consequence of muscle contraction during shivering. Muscle 
secretion of a hormone that activates adipose thermogenesis during 
this process might provide a broader, more robust defence against 
hypothermia. 

The therapeutic potential of irisin is obvious. Exogenously adminis- 
tered irisin induces the browning of subcutaneous fat and thermogenesis, 
and it presumably could be prepared and delivered as an injectable 
polypeptide. Increased formation of brown or beige/brite fat has been 
shown to have anti-obesity, antidiabetic effects in multiple murine 
models’, and adult humans have significant deposits of UCP 1-positive 
brown fat’. Data presented here show that even relatively short treat- 
ments of obese mice with irisin improves glucose homeostasis and 
causes a small weight loss. Whether longer treatments with irisin 
and/or higher doses would cause more weight loss remains to be deter- 
mined. The worldwide, explosive increase in obesity and diabetes 
renders attractive the therapeutic potential of irisin in these and related 
disorders. 

Another potentially important aspect of this work relates to other 
beneficial effects of exercise, especially in some diseases for which no 
effective treatments exist. The clinical data linking exercise with 
health benefits in many other diseases suggests that irisin could also 
have significant effects in these disorders. 


METHODS SUMMARY 

Primary mouse stromal vascular fractions from adipose tissues were differentiated 
as described®. FNDC5/irisin were purchased from ABNOVA (GST fused), or 
produced from Syd Laboratories (GST fused) or LakePharma (Fc fusions). 
Hydrodynamic injections’®, electron microscopy’’ and treadmil running'* were 
performed as previously described. Unless otherwise stated, bar graph data are 
presented as mean + s.e.m., and *P < 0.05 compared to control group. Student’s 
t-test was used for single comparisons and one-way ANOVA for multiple. 


Full Methods and any associated references are available in the online version of 
the paper at www.nature.com/nature. 
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METHODS 

Materials. Antibodies against UCP1, tubulin and FNDC5 were from Abcam. 
Forskolin, insulin, dexamethasone, rosliglitazone, GW6471 and antibody against 
Flag were from Sigma. Primers for all qPCR experiments are listed in Supplemen- 
tary Table 1. Recombinant FNDCS5, LRGI, IL-15, VEGF-B and TIMP4 were from 
ABNOVA (Taiwan). Coomassie staining kit and Lipofectamine 2000 were from 
Invitrogen. 

Identification of PGC1-a-dependent secreted proteins. All PGC1-«-induced 
genes as judged from gene expression analysis in MCK-PGC1-o muscle with a 
fold change of at least 2 and P < 0.05 were subjected to the following analysis. The 
protein sequence of the longest transcript were analysed in the SignalP-software”” 
(http://www.cbs.dtu.dk/services/SignalP/). Sequences with positive S, C, Y and D 
scores were considered positive for a signal sequence. All positive proteins were 
then screened for mitochondrial target sequences (http://www.cbs.dtu.dk/ 
services/TargetP/) whereas positive sequences were removed. All remaining hit 
proteins were then analysed using qPCR in muscle from MCK-PGC1-a mice and 
myocytes overexpressing PGC1-a. 

Primary cell cultures and recombinant protein treatments. The SVF from 
inguinal fat depots of 8-12-week-old BALB/C mice were prepared and differen- 
tiated for 6 days as previously described”’. Rosiglitazone was used for the two first 
days of differentiation. For all experiments, unless otherwise indicated, recom- 
binant FNDCS was added to the culture media at a concentration of 1 g ml’ for 
the last 4 days of differentiation. Primary myoblasts were cultured and differen- 
tiated as described previously”. 

Preparation of protein fractions from cells and media. 293HEK or primary 
myocytes were transfected by standard methods or transduced with adenovirus at 
a multiplicity of infection (m.o.i.) of 20 as indicated. Twenty-four hours after 
transfection, media was removed, and cells were washed in large volumes of PBS 
five times, followed by incubation in Freestyle serum-free media (GIBCO) for 
24h. The cells and media were then collected separately, and media were cen- 
trifuged three times at 2,000g to pellet debris. Thereafter, a quarter volume of ice- 
cold TCA was added and precipitated protein was pelleted at 6,000g and washed 
three times in acetone. Pellet was then dried and resuspended in SDS-containing 
lysis buffer. Protein concentration was measured in both cell and media fraction 
and adjusted either by protein or volume as indicated. 

RT-PCR. qPCR was carried out after Trizol-based RNA extraction using 
RNAeasy (Invitrogen) and thereafter SYBR green. All data were normalized to 
TBP, 18S or indicated in-house genes and quantitative measures were obtained 
using the AAC; method. 

Western blot and quantification. Protein amounts from all samples were 
assessed using the BCA-kit (Thermo Scientific) followed by protein concentration 
normalization before all western blot experiments. Western blot was carried out 
following standard procedure and final band intensity (QL-BG) was quantified 
using BioPix iQ’. All data were normalized to background and loading controls. 
Additional methods. CLARK electrode measurements, energy expenditure in 
vivo, interaperitoneal glucose tolerance test (IGTT) and immunohistochemistry 
against UCP1 were performed as described previously®, with the exception that 
CLARK output was normalized to total cell protein. Fc-fusion construction and 
protein purification was performed by LakePharma. 

Comprehensive laboratory animal monitoring system. C57/B16J mice were fed 
a high fat (60% kcal) diet (D12492, Research Diets) for 20 weeks, starting at 6 
weeks of age. Mice were then injected with indicated doses of adenovirus expres- 
sing GFP or FNDCS, and comprehensive laboratory animal monitoring system 
(CLAMS; Columbia Instruments)-cage analysis was performed as described 
previously®. Briefly, mice were acclimated in metabolic chambers for 2 days 
before analysis in order to minimize stress. CO and O, levels were then collected 
every 36 min for a period of 3 days. Data on activity, heat generation and food 
intake were measured at more frequent intervals. Circadian rhythm was observed 
for most parameters. Data were not normalized to body weights unless otherwise 
stated. 

Mass spectrometry and peptide fingerprinting. Gel bands were digested with 
sequencing grade trypsin (Promega) or ASP-N (Sigma-Aldrich) as per manufac- 
turer’s instructions. Extracted in-gel protein digests were resuspended in 8 11 5% 
formic acid/5% acetonitrile, and 4 ul were analysed by microcapillary liquid 
chromatography electrospray ionization tandem mass spectrometry (LC-MS/ 
MS). Analyses were done on a LTQ Orbitrap Velos mass spectrometer 
(Thermo Fisher Scientific) equipped with a Thermo Fisher Scientific nanospray 
source, an Agilent 1100 Series binary HPLC pump and a Famos autosampler. 
Peptides were separated on a 100m X 16cm fused silica microcapillary column 
with an in-house made needle tip. The column was packed with MagicC18AQ 
Cig reversed-phase resin (particle size, 541m; pore size, 200 A; Michrom 
Bioresources). Separation was achieved through applying a 30 min gradient from 
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0-28% acetonitrile in 0.125% formic acid. The mass spectrometer was operated in 
a data-dependent mode essentially as described previously”* with a full MS scan 
acquired with the Orbitrap, followed by up to ten LTQ MS/MS spectra on the 
most abundant ions detected in the MS scan. Mass spectrometer settings were: full 
MS (automated gain control, 1 X 10°; resolution, 6 X 104; m/z range, 375-1,500; 
maximum ion time, 1,000 ms); MS/MS (AGC, 5 X 10°; maximum ion time, 
120 ms; minimum signal threshold, 4 x 10°; isolation width, 2 Da; dynamic 
exclusion time setting, 30 s). After MS data acquisition, RAW files were converted 
into mzXML format and processed using a suite of software tools developed in- 
house for analysis. All precursors selected for MS/MS fragmentation were 
confirmed using algorithms to detect and correct errors in monoisotopic peak 
assignment and refine precursor ion mass measurements. All MS/MS spectra were 
then exported as individual DTA files and searched with no enzyme using the 
Sequest algorithm. These spectra were then searched against a database containing 
sequence of mouse FNDCS in both forward and reversed orientations. The fol- 
lowing parameters were selected to identify FNDC5: 10 p.p.m. precursor mass 
tolerance, 0.8 Da product ion mass tolerance, fully tryptic or ASP-N digestion, 
and up to two missed cleavages. Variable modifications were set for methionine 
(+15.994915). In addition, a fixed modification for the carbamidomethylation for 
cysteine (+57.021464) was used as well. The C-terminal fragment for FNDC5 was 
identified (KDEVTMKE) by trypsin digestion and reconfirmed by a separate 
ASP-N digestion. 

Preparation of plasma samples for western blot. Thirty-five microlitres of 
mouse or human plasma were precleared for albumin/IgG using the 
ProteoExtract-kit (CalBiochem) as recommended by the manufacturer. 
Samples were then concentrated to approximately 100 ul and >8 pg pl’, fol- 
lowed by deglycosylation of 150 ug using PNGase F (New England Biolabs). 
Totally, 80 ul were then prepared containing 1X sample buffer with reducing 
agent and 1.7 1g yl protein, sonicated, boiled and analysed using western blot 
against FNDC5 or indicated antibody. 

Construction of adenoviral vectors. The FNDC5 expression vector was pur- 
chased with a C-terminal Flag-tag from OriGene. The QuickChange Multi Site 
XL Directed Mutagenesis Kit (Aligent Technologies) was used to introduce a Flag 
tag downstream of the signal sequence and to mutate the C-terminal Flag tag, thus 
resulting in the N-terminal Flag (NTF)-FNDCS5 construct (Fig. 5a). The NTF and 
CTF FNDC5 constructs were then subcloned into the pENTRla vector 
(Invitrogen) and recombined into the pAd-CMV-DEST-V5 vector (Invitrogen) 
and adenovirus was produced using the virapower system (Invitrogen), including 
three rounds of amplification. Thereafter, virus was concentrated using the 
Vivapure adenopack 100 (Sartorius Stedim Biotech) and buffer exchanges to 
saline reaching a concentration of 9-10i.f.u. pl’. A GEP-containing adenovirus 
previously used was prepared in parallel. 

Transgenic mice. The MCK-PGCl1-o transgenic and muscle-specific PGC1-« 
knockout mice have been described previously’. 

Exercise protocols. Twelve-week-old B6 mice were exercised either using 
swimming” or using free wheel running, as described previously”®. 

Human material and exercise training program. Blood samples and skeletal 
muscle biopsies were obtained from eight male non-diabetic individuals before 
and after 10 weeks of aerobic training as described previously’. In brief, the 
exercise-training program consisted of cycling on stationary bikes with 4—5 sessions 
of 20-35 min per week at an average intensity of ~65% of maximal oxygen con- 
sumption. Informed consent was obtained from all volunteers before participation. 
The study was approved by the Local Ethics Committee and was performed in 
accordance with the Helsinki Declaration. 
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X-ray structures of LeuT in substrate- 
free outward-open and apo inward-open 
States 


Harini Krishnamurthy! & Eric Gouaux!? 


Neurotransmitter sodium symporters are integral membrane proteins that remove chemical transmitters from the 
synapse and terminate neurotransmission mediated by serotonin, dopamine, noradrenaline, glycine and GABA 
(y-aminobutyric acid). Crystal structures of the bacterial homologue, LeuT, in substrate-bound outward-occluded 
and competitive inhibitor-bound outward-facing states have advanced our mechanistic understanding of 
neurotransmitter sodium symporters but have left fundamental questions unanswered. Here we report crystal 
structures of LeuT mutants in complexes with conformation-specific antibody fragments in the outward-open and 
inward-open states. In the absence of substrate but in the presence of sodium the transporter is outward-open, 
illustrating how the binding of substrate closes the extracellular gate through local conformational changes: 
hinge-bending movements of the extracellular halves of transmembrane domains 1, 2 and 6, together with 
translation of extracellular loop 4. The inward-open conformation, by contrast, involves large-scale conformational 
changes, including a reorientation of transmembrane domains 1, 2, 5, 6 and 7, a marked hinge bending of 
transmembrane domain 1a and occlusion of the extracellular vestibule by extracellular loop 4. These changes close the 
extracellular gate, open an intracellular vestibule, and largely disrupt the two sodium sites, thus providing a mechanism 
by which ions and substrate are released to the cytoplasm. The new structures establish a structural framework for the 


mechanism of neurotransmitter sodium symporters and their modulation by therapeutic and illicit substances. 


Chemical neurotransmission in the central nervous system is termi- 
nated through re-uptake of neurotransmitters from the synapse into 
surrounding neuronal and glial cells, a process first characterized in 
1961 (ref. 1). Transmitter uptake is mediated by neurotransmitter 
sodium symporters (NSSs)**, integral membrane proteins that exploit 
energetically favourable Na* electrochemical gradients for the thermo- 
dynamically uphill transport of neurotransmitters. Members of the NSS 
family include transporters for GABA, glycine, noradrenaline, serotonin 
and dopamine—chemical transmitters that have fundamental roles 
in the function of the nervous system. Accordingly, dysfunction of 
these transporters is implicated in diseases such as depression*”, 
schizophrenia’, epilepsy® and Parkinson’s disease’, and they are targets 
for therapeutic drugs including tricyclic antidepressants (TCAs) and 
selective serotonin re-uptake inhibitors (SSRIs) as well as addictive sub- 
stances such as cocaine and amphetamines’. Understanding the sub- 
strate translocation mechanism of NSSs is central to the development of 
accurate models of substrate and drug complexes and to the discovery of 
new therapeutic agents. 

The mechanism of NSS transport is generally described by the 
thermodynamically coupled binding of substrate and ion(s) to a central 
binding site that is alternately accessible to either side of the mem- 
brane*? (Supplementary Fig. 1). Crystal structures of LeuT, a bacterial 
NSS homologue, elucidated the architecture of NSS proteins, demon- 
strated the existence ofa substrate- and ion-bound occluded conforma- 
tion, and illustrated how competitive and non-competitive inhibitors 
stabilize an outward-facing conformation’®"*. LeuT is, at present, the 
best template for modelling the structure of NSSs and their complexes 
with therapeutic and illicit drugs. However, our understanding of 


mechanism and structure/function relationships in NSSs is incomplete 
due to the absence of LeuT structures in outward-open and inward- 
open states. 

In the absence of structural knowledge of transporter intermediates, 
general mechanisms of transport have been proposed based on structures 
of other secondary transporters bearing the LeuT fold and crystallized in 
distinct conformational states'*’’. However, low amino acid sequence 
identity compromises the extent to which this approach can generate a 
detailed and accurate mechanism for NSSs. Concomitantly, spectro- 
scopic and computational methods have focused on characterization of 
conformational changes accompanying substrate and ion association/ 
dissociation events in LeuT*’. Although these approaches have yielded 
new insights into localized movements, they have not revealed the three- 
dimensional, atomic-level details of conformational changes associated 
with substrate binding, isomerization of the transporter to the inward- 
open conformation, and release of substrate and sodium ion(s). Here, we 
present crystal structures of the outward-open and inward-open states of 
LeuT and thereby establish the structural basis for transport in the NSS 
family and provide accurate templates for modelling eukaryotic NSSs 
and their complexes with substrates, ions and drugs. 


Substrate-free and inward-open conformations 

To stabilize the substrate-free and inward-open states of LeuT, we 
mutated residues in transmembrane domains (TMs) 3 and 8, helices 
comprising part of the ‘scaffold’ domain (TMs 3, 4, 8 and 9) and in 
TM6, one of the ‘core’ domain helices (TMs 1, 2, 6 and 7)!%??4 in 
the background of the wild-type-like K288A variant (LeuT’)> 
(Supplementary Table 1 and Supplementary Fig. 2). To enhance 
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crystallization behaviour, we raised conformation-specific antibodies, 
exploiting fluorescence-detection size-exclusion chromatography 
(FSEC)”** to select the antibodies and to demonstrate state-dependent 
binding. Well-diffracting crystals of substrate-free LeuT were 
obtained using the Tyr 108 to Phe mutant in TM3 (LeuT*(Y108F))5 
and the 2B12 antibody fragment (Fab). Stabilization of LeuT in an 
inward-open conformation required weakening of the sodium 2 
(Na2) site’? by mutation of Thr 354 to Val and Ser 355 to Ala (TM8), 
as well as the cytoplasmic gate by changing Tyr 268 to Ala (TM6; 
LeuT<(TSY))*32728, together with the 6A10 Fab. 


Sodium -bound state is outward-open 


The substrate-free LeuT“(Y108F) structure (Supplementary Table 2 
and Supplementary Fig. 3) adopts an outward-open conformation as 
a consequence of hinge-like movements in TMs 1b, 2a (residues 41- 
55), and 6a, relative to the outward-occluded state (Fig. la-c, e). 
Extracellular loop 3 (EL3) and TM11 are displaced by as much as 
2.8A and 2.2 A, respectively, due to the movements of TM1b and 
TMo6a (Fig. 1a). Notably, TMs 1b, 2a and 6a pivot at Val 23, Gly 55 
and Leu 257, respectively, suggesting that when substrate no longer 
forges interactions between the core and scaffold domains, con- 
straints on TM1b and TMéa are released, allowing them to move 
outward and the transporter to adopt an outward-open conformation. 
The LeuT*(Y108F) structure is similar to the previously reported Trp- 
bound LeuT structure’ with a root mean squared deviation (r.m.s.d.) 
value of 0.4 A for all Cou atoms (Supplementary Fig. 4), supporting the 


Out 


Figure 1 | The substrate-free, Na*-bound state is outward-open. 

a, Superposition of the outward-open and leucine-bound outward-occluded 
conformations. Outward-open structure is coloured with Na” ions as purple 
spheres. The outward-occluded structure with Na“ ions (spheres) is grey and 
leucine is shown in stick representation. b, Schematic of scaffold and core 
domains, EL4, the pivot points of hinge movements in TMs 1 and 6 (solid black 
circles) and the substrate (S) and sodium sites (+). c, Superposition, as in panel 
a, illustrating that a ~9° rotation about an axis passing through the middle of 
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general principle that inhibitors bind to conformational states popu- 
lated by the wild-type transporter. 

The ‘thin’ extracellular gate’’ of the outward-occluded state is 
ruptured in the LeuT(Y108F) structure. Residues that bridge 
TMI1b/TMé6a to TM3/TM10 in the outward-occluded state have 
separated, opening a pathway to the extracellular solution (Fig. 1d, 
e). Arg 30 no longer forms a water-mediated salt bridge with Asp 404 
(TM10) and the side chain of Phe 253 has rotated away from the 
binding site by a ~90° rotation about the x1 dihedral angle, in agree- 
ment with molecular dynamics simulations*’”’. The coordinated 
movement of Arg 30 and Phe 253 enables the retention ofan important 
cation-7 interaction between their respective guanidinium and phenyl 
groups. The phenyl ring of residue 253 now occupies the same position 
as the indole ring of Trp 602 bound to the extracellular vestibule in the 
Trp-bound LeuT complex’. 

Prominent electron density peaks (4.00) observed in the Nal and 
Na? sites together with ion-oxygen distances of ~ 2.4 A (ref. 30) are 
consistent with sodium occupancy of these sites, although higher 
resolution diffraction data will be required to confirm their identity 
(Fig. 2). Most importantly, the outward-open structure suggests that 
the presence of sodium ions keeps the intracellular thick gate’’ closed 
by bridging interactions between the intracellular halves of the core 
and scaffold domains. Even though the Nal site is located close to the 
pivot points for TM1b and TMéa (Fig. 2a), the concerted movement 
of the helices allows for retention of ion coordination geometry, 
excepting the loss of the carboxylate oxygen from the absent substrate. 


Scaffold 
domain 


Core 
domain 


the core domain (yellow arrow) describes the conformational change associated 
with opening to the outside. Pivot points are shown as in panel b. d, Rupture of 
extracellular gate interactions (grey dashed lines) in the outward-open 
structure. Two water molecules that bridge Arg 30 and Asp 404 in the outward- 
occluded state are shown as red spheres. e, Surface representation of the 
outward-open structure with the zig-zag pink line indicating a closed 
intracellular pathway. Leucine, where shown, is from the outward-occluded 
Leu-bound structure. 


©2012 Macmillan Publishers Limited. All rights reserved 


Figure 2 | Sodium sites in the outward-open state. a, The Nal site showing 
positions of the coordinating residues within the framework of global changes 
in the outward-open structure. b, The Na2 site. F, — F, omit density is 
contoured at 40 and represented as green mesh. Colour scheme and 
representations are the same as in Fig. 1. Dashed lines indicate interactions 
between sodium ions and coordinating atoms with distances in A, solid black 
circles are approximate pivot points for hinge movement of helices and red and 
green cylinders define the TM1 and six helix axes, respectively. 


Binding of Na™ at this site may precede substrate binding, as also 
suggested by simulation studies”, thereby stabilizing local conforma- 
tions of TMs 1 and 6 and engaging the main-chain carbonyl oxygen of 
Ala22 (TM1la) with side-chain oxygens of Asn27 (TM1b) and 
Thr 254 (TM6a). The Na2 site is located towards the intracellular 
region of TM1, stitching TMla to TM8 through the main-chain 
carbonyls of Gly 20 and Val 23 (Fig. 2b). Thus, the Na” ions, through 
their interaction with TM1a, stabilize an intracellular-closed con- 
formation, a finding supported by single-molecule Férster resonance 
energy transfer (FRET) studies”’. 


Structure of the inward-open conformation 

We hypothesized that weakening of the Na2 binding site by mutation 
of Thr 354 and Ser 355 to valine and alanine, respectively, and per- 
turbation of the intracellular gate by mutation of Tyr 268 to Ala”, 
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together yielding the LeuT\(TSY) variant, would shift the conforma- 
tional equilibrium of LeuT towards the inward-open state (Sup- 
plementary Discussion and Supplementary Figs 2, 5, 8 and 9). We 
proceeded to crystallize the LeuT“(TSY)-6A10 Fab complex in the 
presence of lipids and solved the structure by molecular replacement 
at 3.2A resolution (Supplementary Table 2 and Supplementary 
Figs 10 and 11). Electron density for LeuT residues 1-10 was not 
observed and that for amino acids 11-24 was weak (mean B factor of 
158.0 A? versus 121.2 A? for the rest of LeuT). The Fab fragment binds 
on the intracellular side of LeuT<(TSY) (Supplementary Fig. 11). 

LeuTS(TSY) adopts an inward-open conformation as a con- 
sequence of large hinge-like movements within the core domain 
relative to the scaffold domain, and shifts in extracellular loops 
(Figs 3a-c). Perhaps the most notable change involves TM1a. Tilted 
by ~45° from its position in the closed state, it protrudes into the 
predicted location of the membrane. TM6b, in comparison, is rotated 
away from the central binding site by only 17°. In contrast to the 
uncoupled movements of TMs la and 6b, TMs 1b and 6a tilt by a 
similar extent (24° and 21°, respectively) towards the scaffold domain, 
blocking the extracellular pathway. Because TM1 and TM6 do not 
move as a unit, we superimposed the inward-open and outward- 
occluded structures using only the scaffold domain, resulting in an 
r.m.s.d. in Ca positions of 3.0 A. Helices buttressing TM1 and TM6, 
namely TM2, TM7 and TMS, also undergo substantial rearrange- 
ments but their extracellular and intracellular halves move to signifi- 
cantly different extents (Fig. 3c). Thus, these helices bend, rather than 
tilt as rigid bodies, facilitated by either a glycine or proline residue 
located in their midsections. Movement of TM7 causes EL4 to dip 
down further into the extracellular vestibule, thus closing off the 
extracellular solvent pathway. To make room for TM6a, TM11 moves 
away from the centre of the transporter, a direction opposite to that 
seen in the outward-open state (Fig. 3b). 


Access to the substrate-binding site 


In the inward-open LeuT structure, formation of a thick extracellular 
gate cuts off solvent access from the extracellular side while the thick 


Figure 3 | Inward-open conformation. a-c, Superposition of inward-open 
and outward-occluded state structures using the scaffold domain. The overall 
changes shown in b are divided in two parts, a and ¢, for clarity. The extent of 
rotation for the key TMs between the outward-occluded and inward-open 
conformations are indicated in a and c. The axis of rotation of core domain with 
the exclusion of TM1a is depicted in yellow. d, Surface representation of the 


inward-open structure looking ‘up’ into the binding site from the intracellular 
side, as depicted in e. f, Surface representation of the inward-open structure 
showing the elements forming the extracellular thick gate. Colour coding and 
representations are as in Fig. 1. Leucine and sodium, where shown, are from the 
outward-occluded Leu-bound structure. 
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intracellular gate opens, allowing access to the substrate binding site 
from the inside (Figs 3d—f). Consistent with cysteine accessibility 
studies on the serotonin (SERT)”**!*? and GABA (GAT-1)*? trans- 
porters, TM1a, TM5 and TM8 line the intracellular cavity. In particular, 
residues Leu 14 and Ala 19 of TM1a are solvent accessible, in line with 
the marked increase in reactivity observed for the corresponding 
residues in SERT in the inward-open conformation”. Accessibility 
and cross-linking studies also indicate that TM1b and TM6a move 
closer to TM3 (ref. 34) and, together with EL4, become inaccessible 
in the absence of sodium****. Consistent with its essential role in sub- 
strate transport’, EL4 occludes the extracellular pathway by packing 
tightly against TM1b and TM7 on one side and TM3, TM8 and EL2 on 
the other side, making extensive contacts that include hydrophobic 
interactions and a hydrogen bond between Asp401 (TM10) and 
Ala 319 (EL4). 

The collapse of the extracellular vestibule in the inward-open state is 
central to our understanding of how therapeutic agents, including TCAs 
and SSRIs, inhibit NSSs. Large, bulky molecules such as tryptophan”, 
the cocaine analogue 2{-carbomethoxy-3B-(4-iodophenyl)tropane”’ 
and SKF-8997A* arrest LeuT, SERT and GAT-1, respectively, in the 
outward-facing conformation by blocking collapse of the extracellular 
vestibule, which in turn precludes opening of the intracellular gate. 
The inward-open LeuT structure now provides a new template for 
designing novel therapeutic agents that arrest NSS transporters in the 
inward-facing conformation. Ibogaine, shown to bind in the intra- 
cellular vestibule of SERT”, is the only known inhibitor of this type. 

The closure of the extracellular gate radically changes the nature of 
the proposed S2 substrate-binding site, which is hypothesized to be 
occupied by substrate in occluded and inward-facing conformational 
states***, Residues Ile 111 and Leu 400, stated to line the S2 site, are 
now deeply buried (Supplementary Fig. 12). Although we do see a 


Scaffold 
domain fig 


Figure 4 | Changes in gate, substrate and ion site interactions and coupling 
to helix movements. a, Comparison of the extracellular gating interactions in 
the inward-open and outward-occluded structures. View is from extracellular 
side. Polar contacts in the inward-open structure are shown as black dashed 
lines. b, Overall view of inward-open structure showing closed extracellular 
gate (box with solid lines) and open intracellular gate (box with dashed lines). 
c, Comparison of the intracellular gating interactions in the inward-open and 
outward-occluded structures. Interactions forming the intracellular gate in the 
outward-occluded state are shown as grey dashed lines. d, Changes in the 
central substrate-binding site. In comparing the outward-occluded and inward- 
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small non-protein electron density feature near EL4, it is probably not 
a substrate molecule, not only because no substrates were included in 
the protein preparation, but also because the density cannot be well fit 
by leucine or a related amino acid. We suggest that the density is due 
to buffer, detergent or several water molecules. 


Rupture and formation of gating interactions 

The extracellular gate of the inward-open state is closed through 
interactions between residues of TM1b (Arg30) and TM10 
(Asp 404, Gly 408, Thr 409), EL4 (Ala319) and TM10 (Asp 401) 
(Fig. 4a) and TM6a (Asp 240) and TM11 (Tyr 471). The guanidinium 
group of Arg 30, riding ‘on top’ of the aromatic ring of Phe 253 
(TM6a), makes multiple interactions with TM10 that include a direct 
salt bridge with Asp 404 and hydrogen bonds to Thr 409 and the 
carbonyl oxygen of Gly 408. In GAT-1, conservative mutation to 
Lys of the residue equivalent to Arg 30 severely compromises uptake 
activity**, an outcome in accord with the manner in which inter- 
actions of Arg30 are modulated in the outward-open, outward- 
occluded and inward-open states of LeuT (Fig. 4a, b). 

In contrast to the extracellular gate, a network of interactions 
within the thick intracellular gate of outward-facing and outward- 
occluded states is disrupted (Fig. 4b, c). Ionic interactions linking 
the N terminus and TMla (Arg5, Trp8) to TM6b (Ser 267, 
Tyr 268) and TM8 (Gln 361, Asp 369) on the intracellular side no 
longer exist. The splaying apart of TM5, TM7 and TM1a also abro- 
gates interactions between Lys 196 and Thr 10 and between Ser 278 
and Arg1l. Consistent with the disruption of these interactions, 
mutation of residues homologous to Arg5 or Trp 8 (ref. 45), Tyr 268 
(refs 27, 28), Asp 369, Lys 196 and Ser 278 (refs 46, 47) in eukaryotic 
transporters alters substrate uptake and shifts the conformational equi- 
librium of the transporter towards an inward-open state. 


open structures, A22, T254 and F253 move away from the binding site. 
Distances were measured relative to leucine from the outward-occluded 
structure. e, A cartoon representing changes in the core domain relative to the 
scaffold domain and the location of hinges relative to the position of substrate- 
and ion-binding sites is shown. f, Changes in the Nal site. Distances of the 
coordinating residues from the sodium ion of the outward-occluded structure 
are shown. g, Superposition of the inward-open and outward-occluded 
structures using TM7. TM1a of only the inward-open structure is shown for 
clarity. Colour scheme is as in Fig. 1. Location of TM1 and TM6 hinges are 
shown as black spheres in c-f. 
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Perturbation of ion and substrate sites 


The location of hinges for the conformational changes in TM1 and 
TM6 of the inward-open state has profound consequences for the 
substrate and sodium ion sites. The hinge for TMla is located at 
Leu 25, considerably ‘above’ the Na2 site, leading to separation of 
residues on TM1a and TM8 that define this site, thus demonstrating 
how the release of sodium from the Na2 site, the movement of TMla and 
the opening of the transporter to the cytoplasm are coupled (Fig. 4c), a 
mechanism that has also been suggested for the Vibrio parahaemolyticus 
sodium/galactose transporter (VSGLT)*. There are two hinges in TM6, 
Ser 256 for TM6a and Phe 259 for TM6b, while TM1b also pivots at 
Leu 25. Because these residues are all close to the substrate-binding site 
but further away from the Nal site (Fig. 4e), there is minimal per- 
turbation of the former (Fig. 4d) but considerable weakening of the 
latter (Fig. 4e, f). Thus, although changes in TM1 and TM6 affect 
residues coordinating the «-amino group of leucine and cause small 
shifts in the side chains of Phe 253 and Phe 259, residues from TM3 
and TM8 that engage the aliphatic moiety of the substrate maintain 
their positions. Consequently, an important portion of the substrate- 
binding pocket is retained in the inward-open state, preserving the 
ability of the transporter to bind substrate under conditions of 
reversed substrate flux. 


Structural principles of transporter mechanism 


The pseudo two-fold symmetric relationship of TMs 1-5 and 6-10 
together with the organization of TMs 1, 2, 6 and 7 as a four-helix 
bundle led to the rocking bundle mechanism of transport, in which 
the core moves as a unit about a rotation axis oriented approximately 
parallel to the membrane and intersecting the substrate-binding site”. 
Detailed analysis of the inward-open state, however, indicates that 
only a portion of the core moves as a unit and that there is not strict 
adherence to the pseudo two-fold symmetry. The conformational 
transition from the outward-occluded to the inward-open state is 
brought about by multiple adjustments in individual TM helices, 
including the bending of TM2 and TM7 at conserved glycine residues 
and the independent movements of TMs 1a and 6b. Indeed, the core 
domains in the inward-open and outward-occluded states align with a 
rm.s.d.of3.5A (Ca atoms), whereas the scaffold domains align with a 
r.m.s.d. of 0.7 A. If TM 1a is excluded from consideration, however, the 
core domain helices superpose with a r.m.s.d of 0.83 A (Ca atoms), 
thus demonstrating that a portion of the core helices do reorient as a 
unit, undergoing a rotation of ~14° about an axis passing through 
Leu 25 and Phe 259 in the plane of the membrane (Fig. 3a, c). The 
intracellular loop 1 (IL1) and EL4 are also related by the pseudo two- 
fold symmetry axis, yet whereas EL4 undergoes large relative move- 
ments, the position of IL1 is constant. 

The structural rearrangement of TM1a in the inward-open con- 
formation of LeuT is different from that seen in Mhp1 (refs 15, 18), in 
which TM1 moves as a rigid body (Supplementary Fig. 13)”. At 
present it is unclear whether the extent to which TM1a is seen to tilt 
away from its position in the occluded state reflects its true position in 
a native membrane environment. The fact that TMla is neither 
involved in crystal contacts nor interacts with the Fab (Supplemen- 
tary Fig. 11) diminishes the possibility that its position is a crystalliza- 
tion artefact. The weak density of TMla suggests that it is highly 
flexible in the detergent/lipid micelles in which LeuTS(TSY) was crys- 
tallized. Indeed, the apparent mobility and large extent of movement of 
TMlaare in general agreement with findings of single-molecule FRET 
studies and molecular dynamics simulations”. 


Coupling of binding sites and helix movements 

The inward-open structure of LeuT suggests the manner by which 
changes in the sodium- and substrate-binding pockets are coupled to 
the larger structural changes that simultaneously open the intracellular 
side and close the extracellular side. The location of the TM1 hinge at 
Leu25 in the inward-open state, at a position extracellular to the 
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Figure 5 | Schematic of transport in LeuT. a-c, Shown are structural 
elements and gating residues instrumental to conformational changes 
associated with the transition from the outward-open (a) to the outward- 
occluded state (b) and the inward-open state (c). At present there is no crystal 
structure for an inward-occluded state and thus no schematic is provided. 


substrate and sodium sites, is consistent with the observation that in 
the absence of Na* ions TM1a moves away from the scaffold domain 
and TM6b, not only initiating conformational changes that open an 
intracellular pathway, but also resulting in the disruption of the sodium 
ion sites. We propose that it is energetically unfavourable for TM1a to 
remain in an intracellular gate ‘closed’ conformation without the com- 
pensating contributions from bound Na*. We further suggest that the 
movement of TM1a initiates a cascade of structural rearrangements 
that result in closure of the extracellular gate as follows. A change in 
the position of TM1a requires movement of the intracellular region of 
TM5 (Fig. 4g). Yet because TMS lacks the hinge-like regions of TM1 
and TM6, the entire helix tilts, ‘pushing’ TM7 and TM1b and closing 
the extracellular gate (Fig. 3c). Similar structural adjustments involving 
TM2 and TMéa also occur, ensuring that the intracellular gate cannot 
open without simultaneous closure of the extracellular gate. 


Mechanism 

Crystal structures of LeuT captured in three distinct conformational 
states show how both local conformational changes and rigid body 
movements of groups of helices are associated with the transport 
mechanism (Fig. 5). Opening and closure of the extracellular and 
intracellular gates exploit hinge-like bending of helices at pivot points 
within short non-helical regions halfway across the bilayer, together 
with the rigid body rotation of helical bundles, movements of extra- 
cellular loops and flexing of numerous transmembrane helices 
(Supplementary Movie). General principles emerging from these 
studies are that local hinge-like movements of transmembrane helices 
are coupled to the formation and disruption of substrate- and 
sodium-binding sites, which are translated through nearly rigid body 
movements of groups of helices and loops into opening and closing of 
the ‘thick’ extracellular and intracellular gates. Most importantly, the 
relative locations of substrate/ion-binding sites and hinge pivots 
define the extent to which hinge motions perturb the sites. More 
broadly, these structures demonstrate how the overall conformation 
and shape of LeuT changes during the transport cycle, thereby pro- 
viding a general mechanistic framework to understand how sub- 
strates, ions, mutations and drugs modulate the conformational 
equilibria and transport activity of LeuT and related NSS proteins. 


METHODS SUMMARY 


LeuT mutants were expressed as previously described'®, monoclonal antibodies and 
Fab fragments were generated by standard methods, and X-ray crystal structures of 
the Fab complexes were solved by molecular replacement. Final models were 
obtained by an iterative process of manual model building and refinement against 
X-ray diffraction data. The functional properties of LeuT mutants were examined 
using scintillation proximity binding assays and uptake or exchange assays with 
LeuT reconstituted into proteoliposomes. 
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Full Methods and any associated references are available in the online version of 
the paper at www.nature.com/nature. 
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METHODS 

Purification of LeuT mutants. The site-directed mutants LeuT(Y108F), 
LeuT*(TS) and LeuTS(TSY) were produced by polymerase chain reaction 
(PCR) and subcloned into a pET16b plasmid containing a thrombin-cleavage 
site and a C-terminal octa-histidine tag. All mutants were verified by DNA 
sequencing. The resulting proteins were expressed and purified as previously 
described’*”* with the following modifications. To obtain leucine-free trans- 
porter, buffers for purification did not contain sodium salts and membranes were 
washed three times in sodium-free buffer to ensure that no endogenously bound 
leucine was carried along. LeuT mutants were solubilized with n-dodecyl-B-p- 
maltopyranoside (CM) and purified by Ni-affinity chromatography. Protein for 
functional and biochemical assays was further purified by gel filtration in buffer 
containing C).M. 

Monoclonal antibodies and Fab fragments. Mouse monoclonal antibodies 
against LeuT*(Y108F) and LeuT*(TSY) were raised by standard methods using 
corresponding purified protein in detergent as antigen. Antibodies recognizing a 
conformational epitope in LeuT*(Y108F) were selected using FSEC” and western 
blot analysis, resulting in the identification of the 2B12 monoclonal antibody. 
Antibodies specific to the inward-facing mutant LeuT‘(TSY) were selected based 
on their relative affinities for LeuT<(TSY) and wild-type LeuT as determined by 
FSEC. From these screens we isolated the 6A10 monoclonal antibody. All FSEC 
analysis relied on C-terminal GFP fusions of the LeuT variants. Antibodies were 
purified from hybridoma supernatants using a SP sepharose cation-exchange 
column. Fab fragments were generated by papain digestion at 37°C in 50 mM 
sodium phosphate pH 7.2, 1mM EDTA and a papain to monoclonal antibody 
ratio of 1:20 w/w (2B12) or 1:100 w/w (6A10). Cysteine at 10 mM final concen- 
tration was added to the digestion reaction for 6A10. Digestion of 2B12 was 
stopped with 30mM iodoacetamide after 4h, while digestion of 6A10 was 
quenched by transferring the reaction to 4°C after 2h. Fab fragments were 
purified on a Protein A column, followed by ion-exchange chromatography. 
DNA encoding the light and heavy chains of 2B12 (IgG2a, «) and 6A10 (IgG1, 
k) Fab fragments were cloned and sequenced from hybridoma cells using rapid 
amplification of 5' complementary DNA ends (5’-RACE). 

Purification of Fab complexes for crystallization. The His-tag of LeuT destined 
for crystallization was cleaved by thrombin. Transporter was then mixed with 
excess Fab and the complex was subjected to gel filtration in n-octyl-B-p- 
thioglucopyranoside (CgSG)-containing buffer. The buffer consisted of 20 mM 
Tris, pH 8.0, 50mM (LeuTS(TSY)-Fab) or 100 mM (LeuT*(Y108F)-Fab) KCl 
and 12mM (LeuTS(TSY)-Fab) or 15mM (LeuTS(Y108E)-Fab) CSG. For 
crystallization of LeuT“(TSY)-Fab complex, the protein was then supplemented 
with a fivefold molar excess of 1,2-dimyristoyl-sn-glycero-3-phosphoethanolamine 
(DMPE) and incubated at 4 °C for 1h before removing insoluble lipids by cent- 
rifugation. A 55 mM DMPE stock was prepared in 20% DMSO and 80% gel filtra- 
tion buffer. Protein was concentrated to 3.0-3.2 mg ml ~ 1 (LeuT(Y108E)-Fab) or 
4.5-5 mgml ' (LeuT*(TSY)-Fab) for crystallization. 

Crystallization. Crystallization was carried out by vapour diffusion in hanging 
drops at 20°C. LeuTS(Y108F)-Fab crystals were grown in 40 mM Tris pH 7.5, 
23-26% PEG 550 monomethy] ether and 50-100 mM NaCl, and cryo-protected 
in 35% PEG 550 monomethyl ether. Crystals were allowed to sit in the final cryo- 
solution for 4-5 h before flash freezing in liquid nitrogen. 

Different crystal forms were obtained for the LeuT“(TSY)-Fab complex in 
similar crystallization conditions with the best crystals belonging to space group 
C222). These crystals were grown in 100 mM HEPES pH 7.6, 0.1 M Mg(NO3)2, 
12-14% PEG 1500 and 1.5% w/v trimethylamine N-oxide dehydrate in a protein 
to precipitant ratio of 1:2. Leul(TSY)-Fab crystals were cryo-protected with 
15% PEG 1500 and 20% glycerol before flash-freezing. 

Crystals of alanine-bound LeuT*(TS) were obtained from protein purified in 
20mM Tris pH 8.0, 100mM L-Ala, 200mM NaCl and 40mM n-octyl-B-p- 
glucoside (§-OG). Purification and crystallization were carried out as described 
for wild-type LeuT””. 

Data collection and structure determination. For the LeuT(Y108F)-Fab com- 
plex, X-ray diffraction data sets for the crystals were indexed, integrated and 
scaled using HKL2000 and the CCP4 suite of computer programs’’”’. The crystals 
diffract to ~3.1 A resolution, belong to the space group P22;2, and harbour two 
LeuT(Y108F)-Fab complexes per asymmetric unit, giving a Matthews coef- 
ficient® of 3.9 A Da” *. The structure was determined by molecular replacement 
using the program PHASER™. The search probes were the tryptophan-bound 
LeuT structure (PDB accession 3F3A) and a Fab homology model built using 
SWISS-MODEL”. The model was built using COOT” into 2F, — F. maps, cross- 
validated using simulated annealing composite omit maps and refined with 
PHENIX” using non crystallographic symmetry, individual atomic displacement 
parameters and translation/libration/screw (TLS)°* subdomains. Six TLS 
groups composed of two LeuT molecules, two Fab constant domains and two 
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Fab variable domains were defined. This iterative model building and refinement 
procedure yielded a structure with good crystallographic and stereochemical 
statistics. Regions of weak or no electron density were excluded from the model 
such that the final structure consists of LeuT residues 5-129 and 135-507, Fab 
light chain residues 1-215, and Fab heavy chain residues 1-135, 143-219. 

For the LeuT*(TSY)-Fab complex, the crystals diffract to ~3.2 A resolution 
and belong to the space group C222). There is one complex in the asymmetric 
unit, resulting in a Matthews coefficient of 3.4 A Da’ °. Initial phases were derived 
from a molecular replacement solution using PHASER™ in which a partial struc- 
ture of leucine-bound LeuT (PDB accession 2A65) and a high-resolution IgG1 
Fab structure (PDB accession 1Q9Q) were used as search probes. The partial 
LeuT search probe was obtained by deleting TMs 1, 6 and 12 from the 2A65 
structure. Electron density maps calculated using the ‘prime and switch’ method” 
clearly showed the LeuT regions where there were deviations from the 2A65 
structure. This initial molecular replacement solution was used as a starting point 
for automated building using Phenix. Further changes were made to the LeuT 
portion using COOT™. At this stage, electron density for the Fab portion was 
poor, making it difficult for manual building. A homology model built using PDB 
accession 1EJO for the light chain and PDB accession 2Z4Q for the heavy chain by 
PHYRE® resulted in a substantial decrease in the crystallographic R factors, 
yielding Rwor and Rgee Values of 0.33 and 0.36, respectively. This model and a 
2F, — F. map calculated with exclusion of the test set of reflections (Rgree set) were 
then submitted to a crystallographic refinement server (http://godzilla.uchicago. 
edu). The resulting model had improved stereochemistry and Ryo and Réree 
values of 0.29 and 0.32, respectively. 

Further improvements to the model were made until the Rwork/Rfee Values 
converged to 0.26/0.30 and structure quality assessed using Molprobity*' was 
satisfactory. Throughout the model building procedure, bias-minimized ‘prime 
and switch’ maps” and simulated annealing composite omit maps were used. 
Individual isotropic B factors and TLS parameters were refined using 11 TLS 
groups identified using Phenix”’: three LeuT domains comprised of residues 11- 
184, 185-254 and 255-511, four Fab heavy-chain domains involving residues 
1-83, 84-140, 141-180 and 181-219, together with residues 1-94, 95-131, 132- 
177 and 178-215 forming the 4 Fab light-chain domains. The final model consists 
of LeuT residues 11-511, Fab light-chain residues 1-216, Fab heavy-chain resi- 
dues 1-102, 104-219, and several detergent molecules. 

The changes in helix and domain orientation were analysed using the program 
Interhlx (K. Yap, University of Toronto) and DynDom, respectively, and molecular 
figures were prepared using PyMOL. 

Saturation binding assays. Dissociation constants of LeuT mutants for leucine 
were determined by scintillation proximity assays as described previously” at a 
protein concentration of 1 4M in 20mM Tris-MES pH 7.0, 200 mM NaCl and 
1mM C,,M. Non-specific binding was measured in duplicate using 1mM 
L-leucine as cold competitor and total binding was measured in triplicate. Data 
were fit with GraphPad Prism. 

Transport time course. LeuT mutants were reconstituted into liposomes as 
previously described”*, at a protein to lipid ratio of 1:100, and loaded with either 
Buffer 1 (20mM Tris-MES pH 6.0 and 500mM KCl) for uptake or Buffer 2 
(20 mM Tris-MES pH 6.0, 500mM NaCl and 10 mM t-leucine) for exchange. 
Transport was initiated at 27 °C by diluting proteoliposomes to 10 pg ml’ into 
external buffer (20mM Tris-MES pH 6.0, 500mM NaCl) containing either 
200nM (for LeuT(Y108F)) or 400nM (for LeuTS(TS) and LeuT*(TSY)) or 
10 nM [°H]-leucine. Uptake or exchange was followed by quenching 100 jl sam- 
ples of the reaction mixture in 1.8 ml of ice-cold Buffer 1 for uptake or Buffer 2 for 
exchange. Background counts were measured with control reactions performed 
in the absence of sodium. Reactions were measured in replicates of 3-5 and 
analysed using GraphPad Prism. 

FSEC. FSEC experiments” were carried out using a Superose 6 10/300 column 
with the mobile phase consisting of 20 mM Tris, pH 8.0, 1 mM C,,M, and either 
200-400 mM NaCl and 10 mM leucine or 200 mM KCI] for analyses of LeuT in 
substrate-bound and apo states, respectively. Fab-LeuT complexes were prepared 
by mixing LeuT and Fab in a molar ratio of 1:2. Elution was followed using 
tryptophan fluorescence with excitation and emission wavelengths at 280 nm 
and 335 nm, respectively. 
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Most Sun-like stars in the Galaxy reside in gravitationally bound 
pairs of stars’ (binaries). Although long anticipated’ *, the exist- 
ence of a ‘circumbinary planet’ orbiting such a pair of normal stars 
was not definitively established until the discovery’ of the planet 
transiting (that is, passing in front of) Kepler-16. Questions 
remained, however, about the prevalence of circumbinary planets 
and their range of orbital and physical properties. Here we report 
two additional transiting circumbinary planets: Kepler-34 (AB)b 
and Kepler-35 (AB)b, referred to here as Kepler-34 b and Kepler- 
35 b, respectively. Each is a low-density gas-giant planet on an orbit 
closely aligned with that of its parent stars. Kepler-34 b orbits two 
Sun-like stars every 289 days, whereas Kepler-35 b orbits a pair of 
smaller stars (89% and 81% of the Sun’s mass) every 131 days. The 
planets experience large multi-periodic variations in incident stellar 
radiation arising from the orbital motion of the stars. The observed 
rate of circumbinary planets in our sample implies that more than 
~1% of close binary stars have giant planets in nearly coplanar 
orbits, yielding a Galactic population of at least several million. 

The new planets were identified using 671 days of data from the 
NASA Kepler spacecraft’®. As part of its mission"! to detect Earth-like 
planets via the transit method, Kepler is monitoring more than 2,000 
eclipsing binary stars'*’’. From these, we selected a sample of 750 
systems with orbital periods ranging from 0.9 to 276 days, and for 
which eclipses of both stars occur. For each system, we measured 
the eclipse times for both stars and searched for departures from strict 
periodicity, as would be produced by gravitational perturbations from 
a third body. 

All 750 systems were searched by eye for planetary transits, with 
particular attention being paid to a subset (18% of the total) that 
exhibited significant differences between the periods derived from 
the deeper primary eclipses, and those derived from the shallower 
secondary eclipses (for details, see Supplementary Information). 
(Here ‘primary eclipse’ indicates the eclipse of the primary star by 
the secondary star.) This led to the discovery of Kepler-34 and 


Kepler-35, and a candidate system KOI-2939. KOI-2939 (Kepler 
Input Catalog'* number 05473556) exhibited a single transit at BJD 
(barycentric Julian date) 2,454,996.995 + 0.010 of duration 2.5h and 
depth 0.18%. The transit duration constrains the size and velocity of 
the third body and is consistent with a Jovian planet transiting the 
secondary star, but we cannot verify its planetary nature. We defer 
discussion for a future investigation. 

The stars of Kepler-34 (denoted A and B, where A is the brighter, 
more massive primary star) have an orbital period of 28 days, with 
a period difference between primary and secondary eclipses of 
4.91 + 0.59s. Three transits were detected (Fig. 1), with the first and 
second being transits of the primary star, while the third is of the 
secondary star. Notably the transit durations are all different, ruling 
out the most common type of ‘false positive’, a background eclipsing 
binary. Circumbinary transits naturally vary in duration as a con- 
sequence of the changing velocity of the stars. The Kepler photometry 
was supplemented by spectroscopic observations of the radial-velocity 
variations of both stars (Fig. 1f), in order to determine the orbital scale 
and sizes of all three bodies. The photometric and spectroscopic data 
were fitted with a model”'* that accounts for the three-body gravita- 
tional dynamics and the loss of light due to eclipses and transits (see 
Supplementary Information). The model fit confirms that the trans- 
iting body is a planet with 22% of the mass of Jupiter (69 Earth masses) 
and 76% of the radius of Jupiter (8.6 Earth radii). The primary and 
secondary stars are similar to the Sun. Using the spectra, we also 
measured the effective temperature and abundance of heavy elements 
(metallicity) of both stars. The observed stellar parameters match the 
Yonsei-Yale theoretical models of stellar evolution’® for an age of 
5-6 Gyr. The parameters and uncertainties are given in Table 1, with 
details in Supplementary Information. 

The stars of Kepler-35 (A and B) have an orbital period of 21 days, 
with a period difference between primary and secondary eclipses of 
1.89 + 0.48 s. Four transits were detected (Fig. 2 b-e). The first, second 
and fourth events are transits of the primary star, and the weaker third 
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Figure 1 | Observations of Kepler-34. a, A portion of the normalized light 
curve, showing the relative brightness versus time (in units of barycentric Julian 
days, BJD). Low-frequency variations and instrumental drifts have been 
removed (Supplementary Information). The blue points show the primary 
eclipses (star B eclipses star A), orange points show the secondary eclipses, red 
points show the primary transits (planet transits star A), and green shows the 
secondary transit. The times of each event are indicated by the arrows. Owing to 
gaps in the observations, one primary and one secondary eclipse were missed. 
b-d, Close-up views of the three transit events. The solid curve is the 
photometric-dynamical model. Variations in transit widths are mainly due to 
differences in the transverse velocity of the stars during transit. The large drop 
before the transit in c is due to a primary eclipse. e, Close-up views of the phase- 
folded primary and secondary eclipses plotted versus orbital phase (time 
modulo the orbital period P, where P = 27.795794795 d and the time of 
periastron is BJD 2,455,007.5190). Only Kepler Quarter 4 data are shown. 

f, Radial velocities of the primary star (blue points), secondary (orange points), 
and the model curve, versus orbital phase. g, Observed (O) minus computed 
(C) diagram, showing the deviations between the measured eclipse times and 
those predicted assuming strict periodicity. Primary eclipses are shown as blue 
points, secondaries by orange points, and the corresponding models by the red 
curves. A period of 27.79578193 days and an epoch of BJD 2,454,979.72301 
were used to compute the primary eclipse times, and a phase offset of 0.6206712 
for the secondary eclipse times. The divergence indicates the primary and 
secondary periods are different. The two vertical bars in the lower left denote 
the median +1o uncertainties of the primary and secondary eclipse times: 0.10 
and 0.22 min. 


event is a transit of the secondary star. Transits do not occur every 
planetary orbit, placing a strong constraint on the mutual orbital 
inclination and its evolution. The transits differ in duration, and the 
interval between transits is not constant, again signalling a circumbinary 
body. Modelling the photometry and radial velocities yields the system 


1 1 14 
0 0.5 1 0 200 400 600 
Orbital phase Time (BJD — 2,455,000) 


Table 1 | Circumbinary planet system parameters 


parameters given in Table 1. The transiting body is a planet with 13% 
of the mass and 73% of the radius of Jupiter (respectively 41 Earth 
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Planetary properties 
Mass of planet, My (Mjupiter) 


Radius of planet, Rp (Ruupiter) 
Mean density of planet, pp (g cm?) 
Planet surface gravity, g, (cm s-*) 
Properties of the planetary orbit 
Reference epoch (BJD) 

Period, P (days) 

Semi-major axis length, a (AU) 
Eccentricity, e 

e sin(w) 

€ COS(~) 

Mean longitude, 4 = M + w (deg) 
Inclination, i (deg) 

Relative nodal longitude, 2 (deg) 


Properties of the stars in the stellar binary 
Mass of A, Ma (Msun) 


Radius of A, Ra (Rsun) 

Mass of B, Mg (Msun) 

Radius of B, Re (Rsun) 

Flux ratio in the Kepler bandpass, Fe/Fa 
Properties of the stellar binary orbit 
Period, P (days) 

Semi-major axis length, a (AU) 
Eccentricity, e 

e sin(@) 

e COS(m) 

Mean longitude, 4 = M + w (deg) 
Inclination, i (deg) 
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Results of the photometric-dynamical model for Kepler-34 (KIC 8572936) and Kepler-35 (KIC 9837578). The orbital parameters listed are the osculating Jacobian parameters, thatis, the instantaneous Keplerian 
elements for the listed epoch. In general, unlike the simple two-body Keplerian case, the orbital elements are functions of time. In particular, the orbital period of Kepler-34 b varies from 280 to 312 days on secular 
timescales; the median period is ~291 days. See Supplementary Information for details. For direct comparison, values? for Kepler-16 are listed. The mean longitude (A) is the sum of the mean anomaly (M) and the 


argument of periastron («). 
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masses and 8.2 Earth radii). Comparison to stellar-evolutionary 
models suggests a system age of ~8-12 Gyr, although, interestingly, 
the models do not provide a satisfactory match to the stellar masses and 
radii under the assumption of a common age and metallicity (Sup- 
plementary Information). 

The mean densities of Kepler-34 b and Kepler-35 b are respec- 
tively 0.61 and 0.41 gcm™, somewhat lower than the 0.96 gcm™ of 
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Figure 2 | Observations of Kepler-35. The representation of data is similar to 
that in Fig. 1. a, A portion of the light curve for Kepler-35. Owing to 
interruptions in the data acquisition, two primary and two secondary eclipses 
were not observed. b-e, Close-up views of the four transit events. Red points 
denote primary transits, green points denote a secondary transit. Note the 
differences in transit duration. f, Close-up views of the primary eclipses and 
secondary eclipses, plotted versus orbital phase where P = 20.733762175 days 
and the time of periastron passage is BJD 2,455,007.3131. Only Kepler Quarter 
4 data are shown (BJD 2,455,183 through to 2,455,275). g, Radial velocities of 
the primary star (blue points), secondary (orange points) and model fit. 

h, Observed minus computed diagram, where a period of 20.73373997 days and 
an epoch of BJD 2,454,965.84579 were used to predict the primary eclipses, and 
a phase offset of 0.5055680 for the secondary eclipses. The two vertical bars in 
the upper left denote the median +1o uncertainties of the primary and 
secondary eclipse times, 0.27 and 0.26 min, respectively 


Kepler-16 b, but all are consistent with low-density gas-giant planets. 
Figure 3 gives a visual comparison of the systems’ orbits. For all three 
systems, the planetary and stellar orbits are aligned to within 2°, sug- 
gesting that each system formed from a flat disk of material. The period 
ratios (planetary to stellar) for Kepler-34, Kepler-35 and Kepler-16 are 
10.4, 6.3 and 5.6, respectively, only 21%, 24% and 14% larger than 
analytic estimates for stability against three-body interactions'”””’. 
Long-term integration of the equations of motion confirms that these 
two new systems are stable for at least 10 Myr (Supplementary 
Information). Note that the planets’ locations bracket the habitable 
zone’ (where liquid water would be stable on the surface of a rocky 
planet), with Kepler-34 b and Kepler-35 b lying interior to the 
habitable zone and Kepler-16 b lying exterior to it. 

A simple argument suggests that circumbinary giant planets are not 
extremely rare, as three such objects have been seen in our sample of 
750 systems. Given the orbital geometry of Kepler-34, Kepler-35 and 
Kepler-16, the probability”’ that a randomly placed observer who sees 
stellar eclipses would also see planetary transits is approximately 12%, 
14% and 21%, respectively (Supplementary Information). If this 
probability of roughly ~15% were constant across all 750 target 
systems, then the fraction of binaries with circumbinary gas giant 
planets at similar periods would be (3/750) x (0.15) 1, or a few per 
cent. However, this does not account for the period distribution of 
binaries in our sample, and the search is not complete; consequently a 
lower limit of ~1% is reasonable. With ~2.6% of all Sun-like stars in 
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Figure 3 | Orbital configurations. a, Scale view of the orbits of the Kepler-34 
system seen face-on (top) and also as seen from Earth (boxed, bottom). In the 
face-on view, the stars and planet are too small to be seen relative to their orbit 
curves, and so are represented as dots and marked with symbols A, B, and b 

denoting the primary star, secondary star, and planet. This view is correct for a 
given epoch (BJD 2,455,507.50). Because of the dynamical interactions between 
the three bodies, this orbital configuration will evolve. For example, the orbits 
precess, and hence the orbits do not actually close. The line-of-sight view shown 


b A 


in the box depicts the stars and planet with correct relative sizes and orientation. 
More importantly, the orbits and the orbital tilts are accurately portrayed, 
showing how transits do not necessarily occur at every conjunction. The grey 
shaded area in the top panel denotes the limits of the view shown in the 
expanded bottom panel. b, As for a, but for Kepler-35 at epoch BJD 
2,455,330.60. Note that the relative sizes of the bodies are drawn to scale for 
each panel (a-c), not just within a panel. c, As for a, but for Kepler-16 at epoch 
BJD 2,455,213.0. 
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Insolation (Sun-Earth equivalent) 


Figure 4 | Variations in insolation received by Kepler-34 b and Kepler-35 b. 
a, Left panel, the black curve shows the incident flux (insolation) received by 
Kepler-34 b from its two stars. The insolation is in units of the solar constant S 
(solar flux received at a distance of 1 Au; S = 1.0 for the Sun—Earth system). The 
contribution from star A is shown in red and the contribution from star B in 


the Galaxy residing in binary star systems similar to the three discussed 
here*”’ (Supplementary Information), a conservative estimate yields 
millions of nearly coplanar circumbinary planets in the Galaxy like the 
ones reported here. 

Orbital motion of the central stars causes complex time variations in 
stellar insolation for circumbinary planets. Figure 4 shows the calcu- 
lated insolation for Kepler-34 b and Kepler-35 b. The variation is 
multi-periodic, with changes on the timescales of the stellar orbit, 
the planetary orbit, and the long-term precession of the orbits due to 
three-body effects. For Kepler-34 b and Kepler-35 b, the average 
insolation is (respectively) 2.4 and 3.6 times the Earth’s insolation, 
with maximum-to-minimum ratios of 250% and 160%. By com- 
parison, for Venus the insolation is 1.9 times the Earth’s with only a 
2.7% variation. These highly variable, multi-periodic fluctuations in 
insolation are unique to circumbinary planets, and can lead to complex 
climate cycles. It will be interesting to explore the effects of these 
swings in insolation on the atmospheric dynamics (Supplementary 
Information), and ultimately on the evolution of life on habitable 
circumbinary planets. 
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Generation of scaled protogalactic seed magnetic 
fields in laser-produced shock waves 
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The standard model for the origin of galactic magnetic fields is 
through the amplification of seed fields via dynamo or turbulent 
processes to the level consistent with present observations’ ”. 
Although other mechanisms may also operate**, currents from 
misaligned pressure and temperature gradients (the Biermann 
battery process) inevitably accompany the formation of galaxies 
in the absence of a primordial field. Driven by geometrical asym- 
metries in shocks® associated with the collapse of protogalactic 
structures, the Biermann battery is believed to generate tiny seed 
fields to a level of about 107! gauss (refs 7, 8). With the advent of 
high-power laser systems in the past two decades, a new area of 
research has opened in which, using simple scaling relations”, 
astrophysical environments can effectively be reproduced in the 
laboratory"”’. Here we report the results of an experiment that 
produced seed magnetic fields by the Biermann battery effect. We 
show that these results can be scaled to the intergalactic medium, 
where turbulence, acting on timescales of around 700 million 
years, can amplify the seed fields'** sufficiently to affect galaxy 
evolution. 

We conducted these experiments at the Laboratoire pour 
Utilisation de Lasers Intenses (LULI 2000) laser facility. The intense 
laser was used to illuminate a small carbon rod inside a low-pressure 
gas-filled interaction chamber. A full description of the experimental 
set-up, including details of the laser properties, is provided in Fig. 1. 
Because of the large energy density accumulated on the carbon target 
over the relatively short laser pulse duration, the heated solid matter 
initially underwent a ballistic expansion until the shocked mass was 
roughly equal to the ejected mass, and then the shock transitioned to a 
Sedov-Taylor blast wave'*'®. The shock evolution over time was 
monitored using optical diagnostics. The experimentally measured 
shock position for t<200 ns was then compared with a one- 
dimensional radial radiation hydrodynamic simulation using the 
HELIOS-CR software package” (Fig. 2). Given the quantitative agree- 
ment of the experimental measurements of the density values at 
t<200 ns with the results of the HELIOS-CR code, we used the 
simulations to extrapolate the experimental results to later times, 
t ~ 0.5-1.5 us, when the shock arrived at the position of the magnetic- 
field probes but the plasma density was below the detection limit of the 
optical probe. 

The measurement of the magnetic field was performed with three- 
axis magnetic induction coils'*, giving both the magnetic-field com- 
ponents along the shock: normal (Bj)) and perpendicular to the shock 
(B , ), as indicated in Fig. 3. At a radius r ~ 3 cm from the initial blast, 
peak B, values occurring at t ~ 1-2 1s were in the range 10-30 G. The 
position of the first peak in B , was in agreement with the shock arrival 
time estimates based on HELIOS-CR, and was also consistent with the 
time lags between different locations of the probe coils. Finally, shots 


taken with no ambient gas (not plotted) showed no peak in B, for 
t~ 0.5-2 us, which further supports the inference that the magnetic 
field arrives with the shocked plasma. 

It is well known that return currents driven by hot electrons can 
generate magnetic fields in laser-produced plasmas”’’. However, at 
our laser intensity we expected the hot electrons to be distributed with 
temperature Thot~5keV (ref. 20). If so, they would reach the coil 
position at ¢$3 ns, much earlier than the measured peak in B,. A 
magnetic field generated through vorticity by Biermann’s battery’ is 
more plausible. We estimated it as*’: 


OVshock 
0s 


Mion ae (p om 1) Mion 


; = (1) 


Byort = 


where w = V X vis the vorticity, mion is the ion mass, e is the electron 
charge, p ~ 3 is the shock compression ratio (see Supplementary Fig. 1) 
and OVshock/OS is the tangential gradient of the shock velocity. Since 
OVehock/ OS © KVshock/T, where Kk + 0.1-0.3 (Fig. 3a), at r~3cm we 
obtained B,,,,~ 10-30G, in agreement with the measured values. 
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Figure 1 | Experimental set-up showing the laser beams and diagnostics 
configuration. Either one or two frequency-doubled (527 nm), 1.5-ns-long 
laser beams are focused on the tip of a 500-m-diameter carbon rod. At focus, 
each laser beam has a 400-m flat-top distribution, achieving a peak intensity of 
2x10!'*Wcm ?, delivering around 350 J. The interaction chamber is filled 
with helium gas at pressure P = 0.8 + 0.3 mbar or P = 1.6 + 0.3 mbar (with the 
error estimated from the uncertainty in the pressure readings inside the target 
chamber). The shock-wave evolution during times t < 200 ns (where t = 0 
corresponds to the time of the laser pulse) was monitored using transverse 
interferometry and Schlieren shadowgraphy with an optical probe (with 

532 nm wavelength and 6 ns gate width). The interferometer was of Mach- 
Zehnder type with an approximately 25-mm field of view, and it was used to 
provide the electron density. The inset on the left shows an example of an 
interferometric image. The inset on the right shows a streaked self-emission 
optical pyrometry (SOP) image in a narrow band at 450 + 30 nm. 
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Figure 2 | Comparison between numerical simulations and optical 
diagnostics. a, HELIOS-CR simulations (done with a one-beam laser driver) 
and experimentally determined shock position using both interferometry and 
self-emission optical pyrometry. The laser energy deposition in the HELIOS- 
CR code was adjusted to 70% of the nominal value so that the simulated shock 
position would match the experimental values. The transition to a Sedov- 


The range of measured magnetic-field strengths agreed with the 
observed shot-to-shot variations in the measured shock properties at 
early times, as indicated in the Supplementary Information. We 
observed that the peak of the measured magnetic energy density in 
the downstream plasma corresponds to a fraction of a per cent of that of 
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Taylor blast wave occurs at distances r << 10 mm from the sample. b, Calculated 
electron density at t= 150 ns versus electron density values extracted from 
interferometric measurements via Abel inversion, which requires the 
assumption of cylindrical symmetry. Because observed differences in the shock 
wave radius are less than 20%, this gives the uncertainty in the density 
measurement. 


upstream gas density (Supplementary Fig. 1). On the other hand, the 
magnetic-field energy density was significantly larger than the ram pres- 
sure of the incoming ionized plasma: ¢f” = B? / pin? mionVaggce & 0-1 
(with n2~10'4 cm~? the inferred electron density ahead of the shock). 
This suggests that, in this experiment, the kinetic Weibel instability may 


not play an important part, given that one would expect it to produce a 


Figure 3 | Magnetic-field measurements from 
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induction coils. a, Schlieren image, showing laser 
and magnetic pick-up coil configuration as well as 
the shock position. The Schlieren image indicates 
when the refractive index of the plasma changes 
rapidly, thus tracking the position of the shock. 
From the asymmetry in the shape of the shock wave 
we obtain an approximate value for x of 0.1-0.3 in 
equation (1). The induction coils were placed at 
2.8 cm and 3.6 cm from the carbon rod (that is, the 
centre of the initial blast). Each coil consisted of 
eight twisted pairs of coils wound around the axis of 
a 3.1 mm X 3.1 mm plastic core. The voltage from 
the twisted-pair loops was then differentially 
amplified to remove any electric-field components, 
and the magnetic field was calculated using 
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Faraday’s law. The coils were protected from the 
surrounding plasma by a 1-mm-thick glass tube. 
Time resolution was better than 50 ns. We 

estimated the error associated to each magnetic- 
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field trace to be about 15%. b, B, (lines) and Bj| 
(symbols) traces taken at P = 0.8 mbar. The rise and 
gradual decay of B, was consistent with the shock 
front crossing the coil and the subsequent evolution 
of the shocked material. The second peak at t ~ 5 pis 
seen for the case of two beams illumination is 
probably associated with the ejecta material from 
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the sample arriving at the coil. c, B, (lines) and B)| 
(symbols) traces for the one-beam laser driver at 
P= 1.6 mbar. d, Same as b but at r = 3.6 cm. The 
inset shows the measured shock radius along the 
laser axis for P = 0.8 mbar and single-beam 
illumination. We attribute shot-to-shot variations 
in the observed field values (up to ~50%) to the 
stochastic generation of vorticity and to variations 
in the radius of curvature of the shock front (as large 
as ~20% at early times, as shown in the inset). 
Additional details are given in Supplementary Fig. 2. 


26 JANUARY 2012 | VOL 481 | NATURE | 481 


©2012 Macmillan Publishers Limited. All rights reserved 


LETTER 


a 
cS 
Ke} 
e 2 
cS 2 
- ry 
= 
D 
oO 
= 
b 
20 T T T 
—g-— P=0.8 mbar 
15 S -<3- P=1.6mbar 7 
S 40+ 
a 
5E 
OF I ! | “| 
0 0.5 1 1.5 2 


Time (us) 


Figure 4 | Resistive magnetohydrodynamics simulations of the magnetic- 
field generation. The magnetohydrodynamics scheme is an implementation 
of ref. 25. The ionization state was computed using Saha equilibrium”’. 
Radiation transport, thermal conduction and laser coupling were not included 
in the magnetohydrodynamics code. Although radiation transport is important 
during the early stages of the laser drive (for a few nanoseconds), at the time 
when the shock has expanded and magnetic-field measurements are taken, the 
radiation cooling time exceeds the hydrodynamic timescale (taq/ 

Thydro ~ 8 X 10°), so radiative processes do not affect the hydrodynamic 
evolution. For the initial conditions we used the results from HELIOS-CR. At 
t = 150 ns (the time when the shock has swept about 1 cm from the initial 
carbon rod position), the system had uniform density and temperature, except 
for a central region (r< 1 cm) where the temperature is T(x, y) = To 

(1 + dcos0), with Ty = 50 eV and 6 = 0.9, and 0 € [0, 27] is the polar angle. 
Therefore the scale coefficient x is around 1/1, in agreement with experimental 
estimates. The numerical grid consists of 2,000 X 2,000 cells with resolution 
Ax = Ay = 0.005 cm. a, Two-dimensional plot of the perpendicular 
component of the magnetic field obtained from magnetohydrodynamics 
simulations. Note that because of finite magnetic diffusion, the plasma was 
magnetized ahead of the shock front. b, Plot of the magnetic flux density at 
2.7 cm from the centre of the domain (x = 2.5cm, y = 1 cm). 


much smaller field, that is, 0"~(m./mion)~ 107 (refs 4 and 22), where 
mg is the electron mass. 

The vorticity-generated magnetic field is perpendicular to the shock 
normal, consistent with the experimental traces in Fig. 3 that show 
B,>>B). We note that in a perfectly spherical shock no magnetic field 
can be produced because OV .nocx/0S = 0. Magnetic fields can thus only 
be generated in non-spherical expansion shocks, which are expected to 
occur during structure formation in the pregalactic phase’. Our inter- 
pretation is reinforced by a computer calculation that solved the 
resistive magnetohydrodynamics equations in two dimensions 
(Fig. 4) with a baroclinic (Biermann) source term for the magnetic- 
field generation (that is, — Vn. x Vpe jf en, with n, the electron density 
and p, the electron pressure). The asymmetric expansion due to 
the initial temperature anisotropy of the laser illumination drove 
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Table 1 | Similarity scaling between the laboratory and the intergal- 
actic medium 


Scaling parameters Definition Laboratory (LULI) Intergalactic medium 

value extrapolated value 
Characteristic L=2I/k 18.8cm 1 Mpc 
length scale of the 
system 
Characteristic t lus 0.7 Gyr 
timescale of the 
system 
Electron Fes 2eV 100 eV 
temperature 
Electron density Ne 5 x10!5cm-3 10°-4cem-3 
Cyclotron frequency eB 48 x10*s"+ 8.7 x10°!8s"t 

B= 
Mion 

Reynolds number Re 7.9 x 108 3.0 x 1038 
Peclet number Pe 69.0 7.0 x 1013 
Magnetic Reynolds Rem 16.5 3.9 x 10°’ 
number 


The similarity between the astrophysical and the laboratory systems breaks down owing to viscous 
dissipation at laboratory spatial scales h < Aco), Where Aco! is the collisional mean free path®, and equally 
for thermal and magnetic dissipation. This implies that, for the conditions of our experiment, in which 
magnetic dissipation is the dominant effect, full similarity is achieved at macroscopic scales (h 25 yum), 
and the microphysics at the kinetic level will be different. The similarity given here roughly applies to the 
shocked intergalactic medium for scales of L225 pc. 


baroclinic generation of a magnetic field comparable in strength to 
what we measured in the laboratory. 

We applied magnetohydrodynamics scaling techniques to investigate 
the importance of the Biermann battery for magnetic-field generation at 
high-Mach-number astrophysical shocks occurring in the intergalactic 
medium. Similarity in the magnetohydrodynamics equations is pre- 
served if heat convection divided by conduction (the Peclet number), 
inertial forces divided by viscosity (the Reynolds number), and inertial 
forces divided by magnetic diffusivity (the magnetic Reynolds number) 
are all large in both systems (Table 1). From equation (1), the magnetic 
field generated by the Biermann mechanism is proportional to the shock 
generated vorticity. It follows that, on a given spatial scale L, which 
characterizes the velocity gradients, © ~V.oa/L~ I/t (see also 
Supplementary Information). Our laboratory results thus indicate that 
curved intergalactic shocks in protogalactic structures, with changing 
curvature at the level of a few tens of a per cent on scales of about 
one megaparsec, can generate magnetic fields with values of around 
10 “'G. 

The curvature radius, the asymmetry and the resulting magnetic 
field are all consistent with the scenario of protogalactic magnetic- 
field generation from the Biermann battery process modelled with 
numerical simulations in ref. 7. Thus we have experimentally verified 
such a process. Generation of these tiny magnetic fields precedes the 
formation of the galaxy. The magnetic field can quickly be amplified to 
about equipartition values by turbulent motions and can strongly affect 
the evolution of the galactic medium from its early stages'*"*. Indeed, 
astronomical observations show that dynamically strong magnetic 
fields are common even in young galaxies****. This excellent agreement 
between our experiment and current theories of magnetic-field 
generation in the early Universe opens up the opportunity to invest- 
igate other effects that may influence the generation of seed fields. 
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Light-cone-like spreading of correlations in a 
quantum many-body system 


Marc Cheneau!, Peter Barmettler”, Dario Poletti’, Manuel Endres!, Peter Schauf', Takeshi Fukuhara’, Christian Gross’, 


Immanuel Bloch’, Corinna Kollath?’* & Stefan Kuhr’> 


In relativistic quantum field theory, information propagation is 
bounded by the speed of light. No such limit exists in the non- 
relativistic case, although in real physical systems, short-range 
interactions may be expected to restrict the propagation of 
information to finite velocities. The question of how fast correla- 
tions can spread in quantum many-body systems has been long 
studied’. The existence of a maximal velocity, known as the Lieb- 
Robinson bound, has been shown theoretically to exist in several 
interacting many-body systems (for example, spins on a lattice *)— 
such systems can be regarded as exhibiting an effective light cone that 
bounds the propagation speed of correlations. The existence of 
such a ‘speed of light’ has profound implications for condensed 
matter physics and quantum information, but has not been observed 
experimentally. Here we report the time-resolved detection of 
propagating correlations in an interacting quantum many-body 
system. By quenching a one-dimensional quantum gas in an optical 
lattice, we reveal how quasiparticle pairs transport correlations with 
a finite velocity across the system, resulting in an effective light 
cone for the quantum dynamics. Our results open perspectives for 
understanding the relaxation of closed quantum systems far from 
equilibrium’, and for engineering the efficient quantum channels 
necessary for fast quantum computations’. 

Lieb-Robinson bounds have already found a number of fundamental 
applications*°. For example, they enable a rigorous proof of a long- 
standing conjecture that linked the presence of a spectral gap in a lattice 
system to the exponential decay of correlations in the ground state’?”. 
They also provide fundamental scaling laws for entanglement entropy, 
which is an indicator of the computational cost of simulating strongly 
interacting systems’*. Despite intensive theoretical work, the extent to 
which Lieb-Robinson bounds for interacting spins on a lattice” can be 
generalized remains however an open question’*"*. 

In the context of quantum many-body systems, the existence of a 
Lieb-Robinson bound can be probed by recording the dynamics fol- 
lowing a sudden parameter change (a quench) in the Hamiltonian. In 
that case, a simple picture has been suggested: quantum-entangled 
quasiparticles emerge from the initially highly excited state and 
propagate ballistically’, carrying correlations across the system. 
Ultracold atomic gases offer an ideal test bed for exploring such 
quantum dynamics owing to their almost perfect decoupling from 
the environment and their fast tunability'’. In addition, the recently 
demonstrated technique of single-site imaging in an optical lattice’*” 
offers the resolution and sensitivity necessary to reveal the dynamical 
evolution of a many-body system at the single-particle level. 

Our system consists of ultracold bosonic atoms in an optical lattice 
and is well described by the Bose-Hubbard model*°”’. This model is 
parameterized by two energy scales: the on-site interaction, U, and the 
tunnel coupling between adjacent sites, J. Driven by the competition of 
these two parameters, a quantum phase transition between a super- 
fluid and a Mott-insulating phase occurs in homogeneous systems 
where each lattice site is filled by an integer number of atoms, n. In 


the one-dimensional geometry considered here, the critical point of 
this transition is located at (U/J), ~ 3.4 (ref. 22). We observed the time 
evolution of spatial correlations after a fast decrease of the effective 
interaction strength U/J, from an initial value deep in the Mott- 
insulating regime, with filling n = 1, to a final value closer to the critical 
point (Fig. 1a). After such a quench, the initial many-body state | Y%) is 
highly excited and acts as a source of quasiparticles. In order to 
elucidate the nature and the dynamics of these quasiparticles, we have 
developed an analytical model in which the occupancy of each lattice 
site is restricted to n = 0, 1 or 2 (Supplementary Information). For 
large interaction strengths, the quasiparticles consist of either an excess 
particle (“doublon’) or a hole (‘holon’) on top of the unity-filling back- 
ground. The quasiparticles inherit the bosonic nature of the atoms, but 
they can be turned into fermions (fermionized) using a Jordan- 
Wigner transformation. This allows us to partially eliminate the 
non-physical states in which a lattice site would be occupied by two 
quasiparticles. To first order in J/U, we then find that the many-body 
state at time f after the quench reads: 
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‘ (1) 
E — eT alt) +C-8)K/8 ath’ |W) 


a Quench 
@/ \e/ \e/ \e/ \e/ \e/ \e/ \e ¥ 
Position 
b 
e/ \e/ \@& e@/ \e/ \e \e@ 
© @-@ 
—E 
E| \e e/ \e @e/ \e/ 6-0 


Figure 1 | Spreading of correlations in a quenched atomic Mott insulator. 
a, A one-dimensional ultracold gas of bosonic atoms (black balls) in an optical 
lattice is initially prepared deep in the Mott-insulating phase with unity filling. 
The lattice depth is then abruptly lowered (arrow at right), bringing the system 
out of equilibrium. b, Following the quench, entangled quasiparticle pairs 
emerge at all sites. Each of these pairs consists ofa doublon and a holon (red and 
blue ball, respectively; see key at right) on top of the unity-filling background 
that propagate ballistically in opposite directions. It follows that a correlation in 
the parity of the site occupancy builds up at time t between any pair of sites 
separated by a distance d= vt, where v is the relative velocity of the doublons 
and holons. 
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The operators di and ht, create a doublon with momentum k and 
energy éq(k), and a holon with momentum -k and energy &(—k), 
respectively. k belongs to the first Brillouin zone and aja, is the lattice 
period. Quasiparticles thus emerge at any site in the form of entangled 
pairs, consisting of a doublon and a holon with opposite momenta. 
Some of these pairs are bound on nearest-neighbour sites while the 
others form wave packets, owing to their peaked momentum distri- 
bution. The wave packets propagate in opposite directions with a 
relative group velocity v determined by the dispersion relation ¢q(k) + 
é,(—k) (Fig. 1b). The propagation of quasiparticle pairs is reflected in 
the two-point parity correlation functions”: 


Cat) = (5(8)5}+48)) — (0) Beal’) (2) 
where j generally labels the lattice sites and d denotes the distance 


between the two sites considered. The operator 3;(t) =e [s()—7] mea- 
sures the parity of the occupation number 7i(t). It yields +1 in the 
absence of quasiparticles (odd occupancy) and —1 if a quasiparticle is 
present (even occupancy). Because the initial state is close to a Fock 
state with one atom per lattice site, we expect C,(t = 0) ~ 0. After the 
quench, the propagation of quasiparticle pairs with relative velocity v 
results in a positive correlation between any pair of sites separated by a 
distance d = vt. 

The experimental sequence started with the preparation of a two- 
dimensional degenerate gas of *’Rb confined in a single antinode of a 
vertical optical lattice”* (z axis, aja, = 532 nm). The system was then 
divided into about 10 decoupled one-dimensional chains by adding a 
second optical lattice along the y axis and by setting both lattice depths 
to 20.0(5) E, (the number in parenthesis denotes the uncertainty of the 
last digit; see Methods), where E,=(2nh)*/ (8maj,,) is the recoil 
energy of the lattice and m the atomic mass of *’Rb. The effective 
interaction strength along the chains was tuned via a third optical 
lattice along the x axis. The number of atoms per chain ranged between 
10 and 18, resulting in a lattice filling m=1 in the Mott-insulating 
domain. The initial state was prepared by adiabatically increasing 
the x-lattice depth until the interaction strength reached a value of 
(U/J)o = 40(2). At this point, we measured the temperature to be 
T ~0.1U/kg (kg is the Boltzmann constant) following the method 
described in ref. 19. We then brought the system out of equilibrium 
by lowering the lattice depth, typically within 100 1s, which is fast 
compared to the inverse tunnel coupling //J, but still adiabatic with 
respect to transitions to higher Bloch bands. The final lattice depths 
were in the Mott-insulating regime, close to the critical point. After a 
variable evolution time, we ‘froze’ the density distribution of the many- 
body state by rapidly raising the lattice depth in all directions to ~80 
E,. Finally, the atoms were detected by fluorescence imaging using a 
microscope objective with a resolution of the order of the lattice spacing 
and a reconstruction algorithm extracted the occupation number at 
each lattice site’’. Because inelastic light-assisted collisions during the 
imaging led to a rapid loss of atom pairs, we directly detected the parity 
of the occupation number. 

Our experimental results for the time evolution of the two-point 
parity correlations after a quench to U/J = 9.0(3) show a clear positive 
signal propagating with increasing time to larger distances d (Fig. 2). In 
addition, the propagation velocity of the correlation signal is constant 
over the range 2 = d = 6 (Fig. 2 inset), demonstrating that the correla- 
tions spread along an effective light cone. We also found similar 
dynamics for quenches to U/J=5.0(2) and 7.0(3) (Supplementary 
Fig. 1). We note that the observed signal cannot be attributed to a 
simple density wave because such an excitation would result in 
(384.4) = (3) (Bi4a)- We compared the experimental results to 
numerical simulations of an infinite, homogeneous system at T' = 0 
using the adaptive time-dependent density matrix renormalization 
group algorithm”. In the simulation, the initial and final interaction 
strengths were fixed at the experimentally determined values and the 
quench was considered to be instantaneous, at t=0. We found 
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Figure 2 | Time evolution of the two-point parity correlations. After the 
quench, a positive correlation signal propagates with increasing time to larger 
distances. Main figure, the experimental values for a quench from U/J = 40 to 
U/J = 9.0 (open circles) are in good agreement with the corresponding 
numerical simulation for an infinite, homogeneous system at zero temperature 
(continuous green line). Our analytical model (dashed black line) also 
qualitatively reproduces the observed dynamics. Inset, experimental data 
displayed as a colour map (colour scale at right), revealing the propagation of 
the correlation signal with a well defined velocity. The experimental values 
result from the average over the central N sites of more than 1,000 chains, where 
N equals 80% of the length of each chain. Error bars, s.d. 


remarkable agreement between the experiment and theory over all 
explored distances and times, despite the finite temperature and the 
harmonic confinement with frequency v = 68(1) Hz that characterize 
the experimental system. The observed dynamics is also qualitatively 
reproduced by our analytical model for U/J = 9.0. For lower values of 
U/J, however, the model breaks down owing to the increasing number of 
quasiparticles. 

We extracted the propagation velocity v from the time of the cor- 
relation peak as a function of the distance d (Fig. 3a). A linear fit 
restricted to 2=d <6 yields vh/(Jajat) = 5.0(2), 5.6(5) and 5.0(2) for 
U/] = 5.0(2), 7.0(3) and 9.0(3), respectively. The points for d = 1 were 
excluded from the fit, as they result from the interference between 
propagating and bound quasiparticle pairs (see equation (1)). A com- 
parison of the experimental velocities with those obtained from 
numerical simulations (Fig. 3b) shows agreement within the error bars. 
The measured velocities can also be compared with two limiting cases: 
on the one hand, they are significantly larger than the spreading 
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Figure 3 | Propagation velocity. a, Determination of the propagation velocity 
for the quenches to U/J = 5.0 (open circles, dark blue), 7.0 (medium blue) and 
9.0 (light blue). The time of the maximum of the correlation signal is obtained 
from fits to the traces C(t) for distances d= 1 to 6. Error bars, the 68% 
confidence interval of these fits. We then extract the propagation velocities 
from weighted linear fits restricted to 2 = d = 6 (lines). The data for U/J = 5.0 
and 7.0 have been offset horizontally for clarity. b, Comparison of the 
experimental velocities (open circles) at different interaction strengths to those 
obtained from numerical simulations for an infinite, homogeneous system at 
zero temperature (green shaded area). The shaded area and the vertical error 
bars denote the 68% confidence interval of the fit. The horizontal error bars 
represent the uncertainty due to the calibration of the lattice depth. The black 
line (top right) corresponds to the bound v,,,x predicted by our effective model 
(the fading indicates the breakdown of this model). The arrows mark the 
maximum velocity expected in the non-interacting case (left) and the 
asymptotic value derived from our model when U/J— ~& (right). 


velocity of non-interacting particles, v=4 Jaja/h, and twice the 
velocity of sound in the superfluid phase’; on the other hand, they 
remain below the maximum velocity predicted by our analytical 
model, Vinax ~ (6Jajat/2)[1 — 16J7/(9U*)], which can be interpreted as 
a Lieb-Robinson bound (Fig. 3b). In the limit U/J—~, this bound 
corresponds to doublons and holons propagating with respective 
group velocities 4 Jaa,/h and 2 Jaj,,/h. The higher velocity of doublons 
simply reflects their Bose-enhanced tunnel coupling. 

We have presented the first (to our knowledge) experimental obser- 
vation of an effective light cone for the spreading of correlations in an 
interacting quantum many-body system. Although the observed 
dynamics can be understood within a fermionic quasiparticle picture 
that is valid deep in the Mott-insulating regime, we note that our 
experimental data cover a region close to the critical point, for which 
only ab initio numerical simulations are available so far’. Our work 
opens interesting perspectives, such as revealing the entanglement 
carried by the quasiparticle pairs or investigating the quantum 
dynamics in higher dimensions, where little is known about Lieb- 
Robinson bounds and the scaling of entanglement. For example, the 
experiment can be extended to study correlation propagation in two 
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dimensions, where existing numerical and analytical approaches suffer 
from severe limitations. Furthermore, the production rate of excitations 
and the formation of domains when tuning the effective interaction 
strength slowly across the critical point can be investigated, thereby 
exploring a quantum analogue to the Kibble-Zurek mechanism*”””*. 


METHODS SUMMARY 


We calibrated the lattice depths by performing amplitude modulation spectroscopy 
of the transition between the zeroth and second Bloch band in a one-dimensional 
degenerate gas for the x and y axes, and in a two-dimensional degenerate gas for the 
z axis. We estimate the calibration uncertainty to be 1-2%. 

The quench was performed by lowering the lattice depth along the x axis in an 
exponential way (Supplementary Information). Numerical simulations showed 
that the origin t = 0 of the time evolution can be defined as the moment when 
the effective interaction strength reaches the value U/J ~ 17. We used the same 
phenomenological criterion to locate the moment at which the dynamics stops 
when raising the lattice depth to ~80 E,. 

We extracted the time of the maximum of the correlation signal as a function of 
the distance, for both the experiment and the theory, by fitting an offset-free 
Gaussian profile to the traces C,(t). For the numerical data, we filtered out fre- 
quency components above 3 J/h before the fit in order to isolate the envelope of the 
signal. For the experimental data, we fixed the width of the Gaussian profile to the 
value obtained from the numerical data, keeping only the amplitude and time as 
free parameters. 

The numerical simulations relied on matrix product states” to represent infinite 
homogenous systems”’. Initial states were obtained using the DMRG (density 
matrix renormalization group) algorithm. For the time evolution, we used a 
second-order Suzuki-Trotter decomposition™*”*. We achieved quasi-exact results 
on the relevant timescale tJ/h ~ 2 by choosing a small enough Trotter time step 
(~ 0.002) and retaining a few thousand states. 
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Since the invention of the laser more than 50 years ago, scientists 
have striven to achieve amplification on atomic transitions of 
increasingly shorter wavelength'’. The introduction of X-ray 
free-electron lasers*'° makes it possible to pump new atomic 
X-ray lasers'’* with ultrashort pulse duration, extreme spectral 
brightness and full temporal coherence. Here we describe the 
implementation of an X-ray laser in the kiloelectronvolt energy 
regime, based on atomic population inversion and driven by rapid 
K-shell photo-ionization using pulses from an X-ray free-electron 
laser. We established a population inversion of the Ka transition in 
singly ionized neon“ at 1.46 nanometres (corresponding to a 
photon energy of 849 electronvolts) in an elongated plasma column 
created by irradiation of a gas medium. We observed strong amp- 
lified spontaneous emission from the end of the excited plasma. 
This resulted in femtosecond-duration, high-intensity X-ray pulses 
of much shorter wavelength and greater brilliance than achieved 
with previous atomic X-ray lasers. Moreover, this scheme provides 
greatly increased wavelength stability, monochromaticity and 
improved temporal coherence by comparison with present-day 
X-ray free-electron lasers. The atomic X-ray lasers realized here 
may be useful for high-resolution spectroscopy and nonlinear 
X-ray studies. 

The emergence of short-wavelength X-ray free-electron lasers*'° 
(XFELs), delivering femtosecond X-ray pulses that exceed the peak 
brilliance of synchrotron sources by many orders of magnitude, has 
potential to open a pathway to studying nonlinear interactions of 
X-rays with matter'**". Especially appealing are ideas of transferring 
nonlinear spectroscopic techniques to the X-ray regime, to study 
photoinduced charge transfer, nuclear dynamics, chemical reactions, 
valence wavepacket propagation in molecules or phase transitions in 
solids at high temporal and spatial resolution. Nonlinear techniques to 
study these effects, such as X-ray stimulated coherent Raman spectro- 
scopy’’”° or four-wave mixing’*’, would, however, require ultrashort 
(preferably subfemtosecond), temporally coherent, synchronized two- 
colour X-ray sources. Present-day XFELs are based on self-amplified 
spontaneous emission (SASE) and start from noise. Although they are 
of unprecedented brightness, the pulses of SASE free-electron lasers 
have a fluctuating spectrum and limited temporal coherence”. A way 
to improve SASE pulses and to create a synchronized two-colour 
source in the X-ray regime is to use a SASE pulse to pump an atomic 
X-ray laser (XRL)'*”. 

Soft-X-ray lasers were first realized in 1984 at wavelengths near 
20 nm (refs 2,3). In the soft-X-ray/vacuum-ultraviolet regime, popu- 
lation inversion is typically achieved by collisional excitation or recom- 
bination into excited states of highly ionized atoms in a hot, dense 
plasma*”. The first XRL in the kiloelectronvolt regime was proposed in 
1967 and is based on establishing a population inversion by rapid 
photo-ionization of an inner-shell electron’. In 1983 this scheme 
was demonstrated at optical wavlelengths”, and a few years later a 


similar scheme was demonstrated in the vacuum ultraviolet using 
Auger-decay pumping following inner-shell photo-ionization™. The 
photo-ionization scheme was extensively studied in theory for differ- 
ent gain materials and pump sources in the X-ray regime’*’*7’. 
Notably, detailed calculations for the neon Ko transition pumped by 
laser-produced plasma X-rays were presented in 1991'*. However, the 
lack of sufficiently fast and intense X-ray sources has so far precluded 
the realization of the photo-ionization scheme in the X-ray regime. 
More recently, calculations for neon pumped by XFEL radiation, 
which served as the design basis for this experiment, have been 
presented’’, 

When irradiated by X-rays slightly higher in energy than the ioniza- 
tion edge of an inner electronic shell, photo-ionization of an atom 
proceeds from the inside out, creating short-lived core-excited ions, 
which subsequently relax by means of Auger or radiative decay. At 
intensities achievable with XFELs, inner-shell photo-ionization happens 
on a timescale comparable to the lifetime of core-excited states, so that 
states of high ionic charge can be reached within a few femtoseconds and 
double-core-excited ‘hollow ions can be created’*. Moreover, rapid 
photo-ionization results in short-lived population inversions, which 
can be exploited for lasing from the valence to the core level. 

In our experiment, which was carried out at the Linac Coherent 
Light Source® (LCLS) of the SLAC National Accelerator Laboratory, 
XFEL pulses of 960-eV photon energy were focused into a volume of 
neon gas at ~500 torr to spot sizes of 1-2-u1m radius, to create a long, 
narrow column of transiently core-excited ions (Fig. 1). XFEL pulse 
energies on target were between 0.02 and 0.27 mJ, and the pulse dura- 
tion was 40-80 fs. This resulted in intensities up to 2X 10’? Wcm * 
and an inverse photo-ionization rate of ~4 fs, which is comparable to 
the Auger lifetime of the 1s core hole, namely 2.4 fs. Although the decay 
of the core-excited state is dominated by the Auger process, there is a 
small probability (1.8%) that the ion will undergo a spontaneous radi- 
ative decay, emitting a photon with an energy of 849 eV. Photons that 
are spontaneously emitted near the front end of the plasma column 
generate exponentially amplified stimulated emission along the direction 
of the XFEL pulse, which prepares atoms in the excited state, just as the 
atomic line radiation from previously excited atoms reaches them. In 
traditional plasma-based XRLs, the width of the transition is deter- 
mined by Doppler, lifetime and collisional broadening. In the present 
scheme, the density is low and the plasma remains cold, with the result 
that the transition is mainly lifetime-broadened by Auger decay. The 
line profile of the spontaneous emission before strong amplification by 
stimulated emission is therefore Lorentzian with a width of 0.27 eV. 

A grazing-incidence grating spectrometer with a CCD detector was 
used to monitor the spectrum of the transmitted XFEL pump source 
and the atomic XRL radiation. The raw single-shot image shown in 
Fig. 1b provides strong evidence of lasing. The relatively broad line 
(Fig. 1b, bottom) is the transmitted XFEL pulse. The narrow atomic 
line radiation is visible at 849 eV, with peak intensities one order of 


1Lawrence Livermore National Laboratory, 7000 East Avenue, Livermore, California 94551, USA. ?Colorado State University and NSF Engineering Research Center for Extreme Ultraviolet Science and 
Technology, Fort Collins, Colorado 80523, USA. 3LCLS, SLAC National Accelerator Laboratory, 2575 Sand Hill Road, Menlo Park, California 94025, USA. +Present address: Max Planck Advanced Study 
Group, Center for Free-Electron Laser Science, c/o DESY, Notkestrasse 85, 22607 Hamburg, Germany. 
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Figure 1 | Experimental scheme. a, The XFEL beam is focused into a gas cell 
filled with neon to a focus spot of radius 1-2 jm. A flat-field X-ray grating 

spectrometer was positioned ~4 m from the interaction region. b, The charge- 
coupled-device (CCD) image of the transmitted XFEL pump (bottom) and the 
XRL (top). ¢, Level scheme. Population inversion of the 1s'2s”2p°-to-1s°2s"2p” 


magnitude higher than that of the strongly attenuated XFEL. 
Fluorescence would result in ~50 photons entering the spectrometer 
set-up, which is undetectable in the current configuration. Figure 2 
shows line-outs of three different single-shot spectra for an XFEL 
energy of ~0.25 mJ. By contrast with the broad (full-width at half- 
maximum (FWHM) of 8 eV) and jittering (FWHM of 14 eV) spectra 
of the XFEL pulses, the atomic line has a reproducible spectral shape 
with narrow width and fixed centroid. The measured FWHM, of 2 eV, 
is limited by the resolution of the spectrometer. 

To demonstrate exponential amplification on the atomic line trans- 
ition, we varied the incoming XFEL energy. To average over the large 
statistical fluctuations in the XRL output, we gathered a set of 950 
spectral images in a series of ten-shot integrations. Figure 3 shows 
the averaged number of photons in the neon K~ line for each of the 
images as a function of the averaged pulse energy. Doubling the XFEL 
energy from 0.12 to 0.24 mJ results in an average increase in the XRL 
line energy by four orders of magnitude. The XRL pulse energy distri- 
bution is relatively broad at a given pump energy, reflecting the large 
shot-to-shot variation of the gain due to the stochastic properties of the 
SASE pulse. Also shown in Fig. 3 is a comparison to a one-dimen- 
sional, time-dependent, self-consistent gain model'* (see also 
Supplementary Information). There is good agreement between the 
calculated and measured energies in the neon K« line and in the 
transmitted XFEL pump (Supplementary Fig. 6). 
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Figure 2 | Single-shot spectra of the atomic XRL line and transmitted XFEL 
pump. Horizontally integrated spectra for three shots with similar pulse 
energies, but slightly varying XFEL photon energies. The neon gas attenuates 
the XFEL and XRL lines by factors of 2.5 X 10° and 2, respectively. The neon Ka 
line is centred at 849 eV and has a FWHM of 2 eV limited by instrument 
resolution. The neon Kz line shows satellite peaks on both sides of the main 
peak. Their intensities closely follow that of the main peak and are most 
probably ghost lines from the spectrometer grating. 
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transition is created by K-shell photo-ionization of neutral neon. The Auger 
decay time of the inverted state (2.4 fs) dominates the kinetics of the system in 
the small-signal-gain regime. The lower lasing state is depleted by K-shell 
photo-ionization. 


To estimate the influence of the fluctuating temporal profiles of the 
XFEL on the gain, we calculated the small-signal gain coefficient for an 
ensemble of 1,000 numerically generated SASE pulses (Fig. 4). The gain 
for this self-terminating scheme is characterized by a rise time of 5 fs 
and a peak of 64cm '. The gain duration has a FWHM of 20 fs and is 
determined by the convolution of the rate to deplete the neutral atoms 
with the exponential Auger decay. The variation in the peak gain due to 
the SASE pump is considerable; the standard deviation is 30%. 

The traditional method of determining the gain-length product 
(GL) is by measuring the output energy versus length. We estimate 
GL by comparing the measured value of the output energy with a 
simple theoretical estimate (Supplementary Information). For the single 
shot with the highest XRL output, 1.1+0.4\J, we estimate that 
GL = 19.2-21.3, whereas for average shots with 0.24-mJ XFEL energy 
we deduce GL = 17-19.3. This is consistent with GL = 19, the result 
calculated using our one-dimensional model. Owing to strong absorp- 
tion of the XFEL, the gain coefficient decreases as a function of pro- 
pagation depth, with a calculated decay length of 0.28cm. This is 
somewhat longer than the expected attenuation length in neutral neon, 
of 0.22 cm, owing to bleaching. Assuming a gain region 0.28 cm in 
length, we estimate an effective gain coefficient of 61-70cm ' at 
0.24-mJ XFEL energy. Saturation of the exponential amplification 
cannot be discerned in the experimental data presented in Fig. 3. Our 


1x10", 
1 x 10°F 
1x 10°F 
1 x107F 


1x 10°F 


Number of transmitted photons 


1x 105F e736 


0.25 0.3 


pee sie Sas fe ge gg 
ued 0.05 0.1 0.15 0.2 


Incoming XFEL pulse energy (mJ) 


Figure 3 | Dependence of the XRL output on pump power. Average number 
of transmitted photons of the neon K« line for ten consecutive XFEL shots, as a 
function of the average XFEL pulse energy on target (s.d., +0.014 mJ). We 
assumed a beamline transmission of 18%. The determined number of Ka 
photons has an error of ~30%. The small linear signal below a pulse energy of 
0.15 mJ results from spectral tails of the XFEL reaching into the 849-eV energy 
region of the XRL line. Results of a one-dimensional model are shown in red, 
assuming a density of 1.6 X 10!’ atoms cm’ °, an interaction length of 1.8cm, a 
pulse duration of 40 fs and a focal radius of 2 um. 
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Figure 4 | Simulation of the gain. Temporal profile of the average gain (error 
bars, s.d.) at a density of 1.6 X 10° atoms cm ° for an ensemble of 1,000 SASE 
pulses of 40-fs pulse duration, an energy of 0.24 mJ and a coherence time of 
0.3 fs. Also shown is the temporal profile of the gain as determined by the self- 
consistent gain calculations at penetration depths of z = 0.02 cm (blue) and 
0.28 cm (green) for a flat-top pulse with a 0.5-fs Gaussian ramp on. Inset, 
integrated gain-length product as a function of the penetration depth in the 
medium. 


simulations, however, suggest that we are close to the onset of satura- 
tion, which according to our model is reached at GL = 22. In traditional 
XRLs, saturation sets in at far lower values, of GL ~ 15 (ref. 5). In our 
scheme, the small angular divergence, the fast upper-state decay and the 
absence of refraction all contribute to the higher value of GL needed to 
reach saturation. 

The measured output parameters of the neon XRL are a maximum 
single-shot energy of 1.1 + 0.4 J and an angular divergence of 1 mrad 
(see spatial beam intensity profile in Supplementary Fig. 5). This 
energy corresponds to ~8 X 10” photons in the neon Ke line and an 
energy conversion efficiency of 4 X 10 *. The geometry of the gain 
medium (radius, 1-2 1m; length, 0.28 cm) results in a Fresnel number 
of ~1, such that the XRL is expected to be transversely coherent*. Our 
model predicts an average pulse duration of 5-fs FWHM. The XRL, a 
gain-swept amplifier characterized by a fast build-up of the population 
inversion and a short upper-state lifetime, is expected to be nearly 
transform limited and longitudinally coherent”. We estimate the spec- 
tral width to be 0.27 eV, resulting in a relative bandwidth of 0.03%, 
determined by Fourier transform of the square root of the temporal 
flux profile. To determine the spectral, temporal and coherence prop- 
erties accurately, a Maxwell-Bloch approach is required. With the 
above parameters, we estimate the peak brilliance of the XRL to be 
~4X 10° photonss ‘mrad *mm ° within a relative spectral band- 
width of 0.1%. By comparison, the LCLS has a brilliance of ~3 X 10°" 
at a photon energy of 1 keV (ref. 8) and can be operated to produce 
pulses with durations of less than or equal to 10 fs. Considering the 
absolute bandwidth, the XRL has a spectral brightness only a factor of 
30 smaller than that of the LCLS. Notably, the present inner-shell XRL 
is the shortest-wavelength atomic XRL demonstrated so far and its 
peak brilliance is greater by two to three orders of magnitude than 
those of laser-heated XUV plasma lasers”°. 

The XRL photo-ionization scheme can be extended in several ways. 
The transverse coherence and the brilliance could be increased by 
lowering the gas density, thereby lengthening the gain region and 
reducing angular divergence. The output energy of the XRL could be 
increased significantly by a modest increase in the XFEL energy, as 
indicated by the theoretical curve in Fig. 3. By increasing the photon 
energy of the XFEL above the K edges of higher charge states, it should 
become possible to use hydrogen and helium-like neon as lasing ions’’. 
This would result in lines of higher energies and of even narrower 
bandwidth (Aw/w~ 10°), owing to the extended lifetime of the 
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upper state. An increase in the photon flux by a factor of ten and longer 
pulse durations, in the range of 150-200 fs, would be required to access 
these lines. The XRL photo-ionization scheme can be transferred to 
other elements and, thereby, to shorter or longer wavelengths. Lasing 
on a Ke transition seems to be most promising for elements with 
atomic number Z = 20 (refs 14,25). According to a simplified gain 
model, the gain-length product scales like GLocI/”” as wavelength 
(A) decreases (where I denotes the pump intensity). A numerical study 
published before XFELs were available showed that lasing in the Ka 
transition in solid sulphur at 2.37 keV could be established with X-ray 
pulses nowadays achievable with XFELs**. To achieve a gain-length 
product comparable to that of neon, an increase in the pump intensity 
by a factor of approximately ten is necessary, which could be achieved 
by tighter focusing. Another extension is to consider resonant excita- 
tion as a pumping mechanism’’. This scheme, which is based on 
amplification of self-stimulated X-ray Raman scattering, has the 
potential of creating phase correlated two-colour X-ray/vacuum- 
ultraviolet sources suitable for coherent pump-probe applications. 

The demonstrated XRL scheme enhances the capability of present- 
day XFEL sources. By contrast with the SASE XFELs, the atomic XRL is 
expected to be longitudinally coherent and has a stable wavelength 
centroid, that is, a highly reproducible spectrum and a much narrower 
bandwidth. These advantages could be exploited in applications of 
photoelectron spectroscopy or inelastic X-ray scattering. Moreover, 
the scheme directly provides a synchronized, that is, time-jitter free, 
two-colour source in the X-ray regime. Both the transmitted XFEL 
radiation and the atomic XRL radiation have of the order of 10°-10'° 
photons per pulse and ultrashort pulse durations of 40 and 5 fs, 
respectively. The photon energy of the XFEL in this two-colour source 
is tunable, as long as the photon energy is greater than the K-edge 
energy of the gain medium. This scheme hence opens the way to 
two-colour pump-probe studies in the X-ray regime. Intriguing new 
directions realizable with this kind of source would for example be 
stimulated inelastic X-ray scattering for applications in solid-state and 
surface physics or time-resolved molecular spectroscopy by stimulated 
X-ray Raman scattering”. 


METHODS SUMMARY 


LCLS X-ray pulses with ~960-eV photon energy were focused to an estimated 
focus spot of ~1-2-1m radius in a gas cell of 1.4-cm length filled with neon at a 
pressure of 500 torr. The XFEL pulse energy was varied with a gas attenuator 
situated upstream and monitored by two upstream pulse-energy detectors. A 
flat-field grating spectrometer was positioned 4m downstream of the gas cell to 
record the spectra of both the transmitted XFEL pump and the XRL line. The 
spectra were recorded with a conventional 20-,1m-pixel back-illuminated X-ray 
CCD camera. The resolution of the spectrometer is ~2 eV at 1-keV photon energy. 
A triggered shutter allowed for both single-shot and integrated ten-shot measure- 
ments, at a repetition rate of the LCLS between 10 and 60 Hz. To estimate the 
signal and gain parameters of the XRL, we performed self-consistent gain calcula- 
tions. The model is based on a set of atomic rate equations, determining the 
populations of a total of 63 configuration states of neon during interaction with 
an X-ray pulse, coupled to equations describing the propagation, absorption and 
amplification of the X-ray flux in a simplified one-dimensional geometry. The 
processes taken into account are photo-ionization of the valence and core shells, 
Auger decay, spontaneous and stimulated radiative decay, and absorption. We 
consider the forward-propagating X-ray flux at two discrete energies and assume a 
dispersion-free medium. Calculations for small signal gain were performed for a 
stochastic ensemble of SASE pulses, created by a Monte Carlo method. For com- 
parison with the ten-shot data, we assumed flat-top pulses, representing the average 
temporal structure of the SASE ensemble. 
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Self-assembly provides an attractive route to functional organic mate- 
rials, with properties and hence performance depending sensitively 
on the organization of the molecular building blocks'*. Molecular 
organization is a direct consequence of the pathways involved in the 
supramolecular assembly process, which is more amenable to detailed 
study when using one-dimensional systems. In the case of protein 
fibrils, formation and growth have been attributed to complex 
aggregation pathways’ * that go beyond traditional concepts of homo- 
geneous’ "' and secondary’* “* nucleation events. The self-assembly of 
synthetic supramolecular polymers has also been studied and even 
modulated’*""*, but our quantitative understanding of the processes 
involved remains limited. Here we report time-resolved observations 
of the formation of supramolecular polymers from n-conjugated oli- 
gomers. Our kinetic experiments show the presence of a kinetically 
favoured metastable assembly that forms quickly but then transforms 
into the thermodynamically favoured form. Quantitative insight into 
the kinetic experiments was obtained from kinetic model calculations, 
which revealed two parallel and competing pathways leading to 
assemblies with opposite helicity. These insights prompt us to use a 
chiral tartaric acid as an auxiliary to change the thermodynamic pref- 
erence of the assembly process’. We find that we can force aggrega- 
tion completely down the kinetically favoured pathway so that, on 
removal of the auxiliary, we obtain only metastable assemblies. 

The x-conjugated oligomer S-chiral oligo(p-phenylenevinylene) 
(SOPV, Fig 1a) serves as a functional material in a variety of organic 
electronic devices, with performance critically dependent on the 
material’s morphology”. This morphology is determined by the 
assembly mechanisms that transform the molecular dissolved 
monomers into the materials used in devices. The hydrogen-bonded 
dimers of SOPV self-assemble in apolar solvents via m-1 interactions 
into one-dimensional helical stacks, and earlier circular dichroism (CD) 
and ultraviolet-visual spectroscopy studies under thermodynamic 
control revealed exclusive formation of left-handed M-type helical 
aggregates (M-SOPV) through a nucleated growth mechanism’. 
In practice, however, the assembly process is often under kinetic 
control—prompting us to examine in detail the supramolecular 
polymerization process under non-equilibrium conditions. 

Upon rapid quenching of SOPV from the molecularly dissolved 
state to 273 K, we observed the formation of a mixture of M-SOPV 
and aggregates with opposite helicity, as evidenced by the opposite sign 
of the bisignated Cotton effect (Fig. 1b). At 298 K, these right-handed 
P-type aggregates (P-SOPV) slowly converted into thermodynamically 
stable M-SOPV aggregates (Supplementary Fig. 1). The observation of 
metastable P-SOPV aggregates indicated that the supramolecular 
polymerization of SOPV involves two different aggregation pathways, 
which we termed the on-pathway (leading to M-SOPV) and off- 
pathway (leading to P-SOPV) (Fig. Ic). 

To study the aggregation mechanism of SOPV and quantify the self- 
assembly pathways under kinetic control, we conducted stopped-flow 


experiments in which a concentrated solution of molecularly dissolved 
SOPV in chloroform was mixed with an excess of methylcyclohexane 
(MCH) to initiate self-assembly. The subsequent formation of helical 
SOPV aggregated in MCH was probed using CD spectroscopy 
(Supplementary Discussion 1). These kinetic experiments were con- 
ducted at 293 K and 308 K, with different concentrations of SOPV. At 
293K and the lowest SOPV concentrations, the rate of aggregate 
formation initially increased with time, characteristic of a lag phase 
(Fig. 2a, c). The time-dependent CD signal was always negative under 
these conditions, suggesting the direct formation of thermodynamically 
stable on-pathway M-SOPV aggregates. At higher concentrations 
(29 tM) a positive CD signal appeared in the initial stages of the 
assembly process and then developed into a negative CD signal at later 
times, suggesting the initial formation of off-pathway P-SOPV aggre- 
gates that then convert into thermodynamically stable M-SOPV aggre- 
gates. Remarkably, the time at which 50% of the aggregation process 
was completed (t— 50) is longer at 151M than at 10 uM (Fig. 2d). 
Analogous kinetic studies at 308 K also revealed the presence of a lag 
phase in experiments at the lowest concentrations (Fig. 2b, e) and an 
inverted dependence of t — 50 on concentration, although the shortest 
t — 50 time shifts to a higher SOPV concentration (20 UM; Fig. 2d). 
Detailed analyses unambiguously prove that both the on- and off- 
pathway are formed via a homogeneously nucleated growth mech- 
anism (Supplementary Figs 5 and 6). 

To rationalize the experimental aggregation kinetics, we extended 
models known in the field of protein fibrillization®’"” by incorporating 
two competing, nucleated assembly pathways in which prenucleus 
oligomers (oligomer aggregates below the critical nucleation size) 
and helical aggregates change size through monomer association 
and dissociation (Fig. 3a, Supplementary Discussion 2). For the 
M-SOPV aggregates, non-helical prenucleus oligomers change size 
with rate constants for association and dissociation of a and b, respec- 
tively. Once the critical nucleus (with size n) is reached, the helical 
aggregates are in the elongation regime and change size through 
monomer association with the same rate constant a, while dissociation 
proceeds with rate constant c. The steady-state concentration of the 
aggregates in the nucleation phase is determined by the nucleation 
equilibrium constant K, = a/b, while in the elongation phase it is 
determined by the elongation equilibrium constant K, = a/c. The 
nucleated growth of off-pathway P-SOPV aggregates is described 
analogously, with nucleus size n*, rate constants a*, b* and c*, and 
equilibrium constants K,* and K,*. An essential model assumption is 
that the transition from metastable to thermodynamically stable aggre- 
gates occurs via depolymerization of P-SOPV aggregates and sub- 
sequent growth of M-SOPV aggregates. This is justified by the high 
helix reversal penalty (8.1 kgT; where kg is the Boltzmann constant and 
T is temperature), obtained by ‘majority rules’ experiments, which 
rules out intrastack stereomutation as an alternative transition mech- 
anism (Supplementary Fig. 8)”. 
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Figure 1 | Pathway complexity in supramolecular polymerization of SOPV. 
a, Molecular structure of SOPV with S-chiral butoxy side chains. b, CD spectra 
of SOPV in MCH in disassembled state (343 K, black line), thermodynamically 
stable M-SOPV (273 K, blue line) and mixture of M-SOPV and metastable 
P-SOPV (273 K, red line). c, Schematic representation of the aggregation 
pathways of SOPV, including the growth of two competing assemblies. Self- 
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(left-handed) 


Helical aggregates 


assembly of SOPV in apolar solutions is initiated by formation of a quadruply 
hydrogen-bonded dimer. The hydrogen-bonded dimer further self-assembles 
into helical stacks via a nucleation-elongation growth mechanism. Right- 
handed P-helices form quickly but are less stable than the left-handed 
M-helices, which form more slowly. 
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Figure 2 | Experiments under kinetic control. Stopped-flow studies on self- 
assembly of SOPV. a, Concentration-dependent kinetics at 293 K. SOPV 
concentrations (black arrow shows increasing concentration) are colour-coded 
as follows: 5 1M (black), 6.1 1M (red), 6.5 uM (light green), 7.8 UM (dark blue), 
8.7 uM (light blue), 9 uM (pink), 10 uM (yellow), 15 uM (dark green) and 

19 uM (orange). b, Concentration-dependent kinetics at 308 K. SOPV 


concentrations are 10 1M (black), 12.5 uM (red), 15 1M (green), 20 1M (blue) 
and 25 1M (orange). c, Zoom in for initial stages of growth process studied at 
293 K in panel a. d, Time at which 50% of final CD intensity is reached (t — 50) 
versus concentration at 293 K and 308 K. The lines serve to guide the eye. 

e, Zoom in for initial stages of growth process studied at 308 K in panel b. 
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Simulations with this model showed that off-pathway P-SOPV 
aggregates only appear in the initial stages of the self-assembly process 
if sufficient monomers are present and P-SOPV formation is kinetically 
favoured over that of M-SOPV. Specifically, the total monomer con- 
centration (c,.,) has to exceed the critical concentration of P-SOPV 
(Ctot > Ke*'), which depends on the elongation equilibrium constant 
of this species. Moreover, the initial formation of P-SOPV is kinetically 
favoured when the forward rate constant of nucleation and elongation 
is larger for the off-pathway aggregates (a* > a) or if the pre-nucleus 
oligomers of P-SOPV are kinetically more stable (K,* > K,, or a*/ 
b* > a/b) (Supplementary Fig. 9). As a result, off-pathway P-SOPV 
aggregates can only self-assemble at high concentrations and low tem- 
peratures (high K,*), as was observed experimentally. With a* > a, the 
kinetic model successfully described the experimental kinetic data, 
including the initial formation of P-SOPV (Fig. 3b). The calculated 
Gibbs free energy diagram revealed that the P-SOPV nucleus is thermo- 
dynamically less unstable than the M-SOPV nucleus (that is, 
K,,* > K,), whereas M-SOPV is more stable in the elongation phase 
(Fig. 3c). Kinetic studies on protein fibrillation have shown that for the 
initial stages of homogeneously nucleated aggregation involving a 
thermodynamically unstable nucleus, the conversion to the stable 
product is proportional to time squared, ¢” (ref. 9). In our experiments 
with the lowest SOPV concentrations, the changes of the CD signal in 
the lag phases indeed can be described as being proportional to f’, 
demonstrating homogeneous nucleation in the aggregation of SOPV 
(Supplementary Fig. 10). 

To corroborate the proposed involvement of two aggregation 
pathways, we further analysed the concentration dependence of the 
kinetic data and found that the changes in the t— 50 values with 
concentration can only be rationalized by taking off-pathway aggrega- 
tion into account (Fig. 3d, e): the relatively large amount of metastable 
P-type aggregates formed at higher concentrations sequester mono- 
mers and lower their solution concentration, thereby hampering the 
formation of the thermodynamically stable M-type aggregates. Our 
simulations did not show a further decrease of t—50 at higher 


concentrations with reasonable model parameters. Because our 
model only considers monomer addition and dissociation, oligomer 
reactions and fragmentation’*—which become more important at 
higher concentrations—may explain this difference between simulated 
and observed behaviour. Simulations exploring the effect of temper- 
ature showed that the maximum in the aggregation rate shifts to higher 
concentrations and the aggregation rate gets faster (that is, t — 50 
values get lower) on increasing temperature, as was observed experi- 
mentally (Supplementary Fig. 11). 

The discovery of these two different assembly pathways, producing 
structurally different aggregates, encouraged us to attempt to direct the 
assembly of SOPV towards metastable products. To force the assembly 
of SOPV into exclusively P-type aggregates instead of the mixture 
obtained upon fast cooling (see above), we used a two-step non- 
covalent synthetic method (Fig. 4a, b) inspired by previous assembly 
control experiments’’. The earlier work involved achiral oligo(p- 
phenylenevinylene) monomers that assemble into chiral helical archi- 
tectures of equal energy, and showed that the addition of chiral 
auxiliaries transforms an initially racemic mixture of these structures 
so that one of the two possible chiral helical architectures dominates”. 
Here, we used a chiral auxiliary to direct the assembly so that, upon 
removal of the auxiliary, we obtain the thermodynamically unstable 
P-type aggregates. 

In the first step, S-chiral dibenzoyl tartaric acid (DTA) was used to 
dictate its chirality to the SOPV monomer via two-fold hydrogen 
bonding, orthogonal to the SOPV dimerization (Supplementary 
Discussion 3). Aggregates having opposite helicity (P-DTA-SOPV) 
compared to equilibrium conditions (M-SOPV) are formed, as 
demonstrated by the opposite CD spectrum (Fig. 4c). In the second 
step, the complete removal of DTA from the SOPV aggregates by 
aqueous extraction at 273K using ethylene diamine resulted in the 
formation of transiently stable P-SOPV aggregates at this temperature 
(Supplementary Fig. 12). The exclusive formation of P-SOPV is 
demonstrated by the absolute ellipticity of P-SOPV, which equals 
the ellipticity of a solution containing exclusively M-SOPV aggregates 


a Off-pathway metastable Thermodynamically b 
aggregate P stable aggregate OF 
8 ee, 
S ; : E 
Zz Nucleus n*: Be Nucleus n: * = 20 
Py +X => P, M, 1+ XS M, e) 
c -30; 
2 — Kinetic data 293 K, 15 uM 
ae — Simulation with pathway competition model 
= 1 1 1 1 
2 0 1,000 2,000 3,000 
Time (s) 
c d e 
tT f; ‘ 
— Qa be 
£ ol , i " 10 
E€ = : | Concentration DB 
QZ 10 g 0 Concentration Y// 8 M 
roy £ ~ 108 P+M 
— an M 8 
- S es 
2 4 6 8 10 10-7 10° 107 10? 103 10% 10° 105 10+ 


Chain length (no units) 


Figure 3 | Analysis aggregation pathway competition with kinetic model. 
a, Schematic representation of reversible kinetic model for nucleated 
supramolecular polymerization, including a metastable off-pathway and a 
thermodynamically stable on-pathway competing for free monomer. b, Kinetic 
trace of the self-assembly of SOPV and a simulated kinetic trace using the 
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pathway competition model. The inset to b shows a zoom in for the initial stages 
of the growth process. c, Calculated Gibbs free energy diagram for formation of 
P-SOPV and M-SOPV. d, Concentration-dependent simulation of kinetics with 
pathway competition model. e, t — 50 versus SOPV hydrogen-bonded dimer 
concentration obtained from simulated concentration-dependent kinetics. 
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Figure 4 | Controlled formation of exclusively metastable aggregates via a 
two-step non-covalent synthetic methodology. a, Molecular structure of 
S-chiral dibenzoyl tartaric acid. b, Schematic representation of preparation of 
pure P-SOPV via a two-step non-covalent synthetic methodology. c, CD 


under equilibrium conditions. The kinetic lability of P-SOPV is 
demonstrated by annealing at 298K, resulting in a time-dependent 
stereomutation of the CD spectra indicative of a conversion from 
P-SOPV towards M-SOPV aggregates (Fig. 4d). The single isosbestic 
point in these time-dependent CD spectra and their apparent first-order 
kinetics (Supplementary Fig. 13) indeed indicated that two distinct states 
are involved in the conversion process: P-SOPV and M-SOPV. 

Our observations established that the aggregation of SOPV involves 
two competing pathways leading to assemblies with opposite helicity, 
one of which is favoured kinetically and the other thermodynamically. 
We note that kinetically controlled formation of one-dimensional 
supramolecular aggregates has been reported”*”*, but the metastable 
states involved have so far only been probed in terms of their static 
properties (such as morphology or photophysical properties). Our 
kinetic studies, by probing the full aggregation process starting from 
free monomers, reveal that the influence of the metastable state—the 
off-pathway aggregates—on the overall assembly process is mediated 
through the equilibrium with free monomers. This effect is a common 
mechanism for one-dimensional supramolecular systems. We also 
note that almost all organic materials used in functional devices are 
processed starting from solutions of the molecular dissolved compo- 
nents. This means that by influencing the subtleties of the self-assembly 
process generating these materials, through tuning of the on-pathway 
or off-pathway mechanisms, the resulting morphologies could poten- 
tially be controlled to arrive at optimized self-assembled functional 
materials. 


METHODS SUMMARY 


Thermodynamically stable M-SOPV (MCH, 100 1M) is obtained upon slow cool- 
ing from the molecularly dissolved state (60 Kh ') to 273K, and equilibrium 
conditions are verified by time-dependent measurements, showing no changes in 
time. Metastable P-SOPV (MCH, 100 [1M) is obtained upon quenching molecularly 
dissolved SOPV to 273 K (ice bath). Stopped-flow experiments are performed by 
mixing a concentrated SOPV solution in chloroform with MCH (1:50 volume ratio, 
Berger ball mixer, mixing time <10ms). The subsequent formation of helical 
aggregates is probed by a CD spectrometer (466 nm) connected in-line with the 
stopped-flow instrument. The absence of (oligomeric) SOPV aggregates at the start 
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spectra of M-SOPV, P-DTA-SOPV and P-SOPV. d, CD spectra acquired 
during conversion from metastable P-SOPV towards M-SOPV upon annealing 
at 298 K. 


of the experiment is verified by 'H nuclear magnetic resonance (NMR) studies. The 
nucleated aggregation pathway competition model describes the kinetic data with 
parameters n=n*=5; a=2.9X 10*M!s7; K.= 1.52 X 10°M"}, K,/ 
K. = 0.0526, K,*/K, = 1.38, K.*/K. = 0.164 and a*/a = 3.79 (Matlab, ode15s sol- 
ver). The value of K, used is independently assessed by melting experiments. 
Alternatively, models using diffusion-controlled kinetics afford unrealistic values 
of K,/K.< 10*. The Gibbs free energy as a function of chain length i is calculated 
using AG°( — 1) 414 = —RTIn(Ke,o,), where R is the gas constant. To obtain P-DTA- 
SOPV, DTA and SOPV are mixed in chloroform (1:1). Subsequently chloroform is 
evaporated; the residual DTA-SOPV complex is dissolved in MCH (100 1M) on 
heating above the critical elongation temperature. Cooling from the molecularly 
dissolved state to 293 K results in P-DTA-SOPV aggregates. On adding an equal 
volume of aqueous ethyl diamine solution (0.15 M) at 273 K, vigorous mixing of the 
water and MCH phase (1 min) and full separation of both phases, the top MCH 
layer containing metastable P-SOPV is isolated. 


Full Methods and any associated references are available in the online version of 
the paper at www.nature.com/nature. 
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METHODS 

Materials. The synthesis of SOPV has been described previously”’. All solvents 
were used as received. 

Instrumentation. CD, linear dichroism and corresponding ultraviolet-visual 
spectra were recorded using a Jasco J-815 CD spectrometer. Sensitivity, response 
time and scanning speed were chosen appropriately. The temperature was con- 
trolled using a Jasco Peltier temperature controller with a range of 263-383 K and 
adjustable temperature slope. Stopped-flow studies were performed using a 
Biologic SFM 400 stopped-flow setup with Berger Ball mixer, Biologic TC 100 cuvet 
(optical path length 1 cm), and Biologic MPS-60 controller unit. The stopped-flow 
cuvet was connected in-line with a Jasco J-815 CD spectrometer. To control the 
temperature of the cuvet and syringes, SFM 400 was connected to a Julabo F12 
temperature controller (ethylene glycol bath with thermostat). "H-NMR spectra 
were recorded at 298 K on a Varian Unita Inova (500 MHz), in CDCI; (Cambridge 
Isotope Laboratories). Fluorescence measurements were performed on an 
Edinburgh Instruments FS920 double-monochromator luminescence spectro- 
meter using a Peltier-cooled red-sensitive photomultiplier. Ultraviolet-visual 
spectra acquired on the complexation of o-cyclodextrin with p-nitrophenolate 
were recorded on a Perkin Elmer L40 photospectrometer. 

Experiments under kinetic control. Thermodynamically stable M-SOPV 
(100 4M) in methylcyclohexane (MCH, spectroscopic grade, Sigma-Aldrich) 
was obtained upon slow cooling (60 Kh ') from the molecularly dissolved state 
(343 K) to 273 K; equilibrium conditions were verified by time-dependent mea- 
surements, showing no further change in time. Metastable P-SOPV (MCH, 
100 1M) was obtained upon quenching molecularly dissolved SOPV from 343 K 
to 273 K (ice bath). The optical path length for the reported spectra was 1 mm. The 
fraction of P-SOPV and M-SOPV obtained upon quenching SOPV (MCH, 
13 1M) from different temperatures was evaluated from the CD spectra by simu- 
lation of a linear combination of the spectra corresponding to a solution consisting 
of pure P-SOPV (obtained via the two-step non-covalent synthetic methodology) 
anda solution containing M-SOPV exclusively. The fraction of free SOPV monomer 
present before quenching was derived from the CD spectra under equilibrium con- 
ditions assuming Pmonomer = 1 —Phetical aggregate. The optical path length of the 
reported spectra for these quenching experiments was 1 cm. 

Stopped-flow experiments. To probe the kinetics of the supramolecular 
polymerization of SOPV, SOPV in chloroform (spectroscopic grade, Sigma- 
Aldrich) was mixed with MCH (50:1; 1,508 ul:30 pl, injection rate 8-9 mls}, 
mixing time <10ms), after which CD was followed in time (A= 466 nm; 
Ad = 1nm; At = 1s; standard sensitivity; high tension voltage adjusted to get a 
direct-current voltage around 1 V). The SOPV concentration and the duration of 
the measurement were adjusted appropriately. The absence of (oligomeric) SOPV 
aggregates at the start of the experiment was verified by 'H-NMR studies. Kinetic 
curves were averaged over multiple stopped-flow experiments as follows. At 293 K: 
5 uM, 9; 6.1 uM, 2; 6.5 uM, 6; 7.8 uM, 1; 8.7 uM, 4; 9 uM, 2; 101M, 9; 15 1M, 3; 
19 1M, 6. At 308K: 10§M, 6; 12.5 1M, 6; 15 uM, 11; 201M, 9; 25M, 6. After 
the measurement, the concentration was verified using CD (466nm) and 
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ultraviolet—visual (438 nm) spectra. To test the mixing in the stopped-flow setup, 
the complexation of «-cyclodextrin and p-nitrophenolate was measured. Stopped- 
flow experiments were performed upon 100:1 mixing of an o-cyclodextrin 
(Across) and a p-nitrophenolate (Merck) solution. Both solutions were buffered 
at pH = 11 (HPO, /PO,>- buffer). Settings: injection volumes (100:1) 2,011 ul:20 
ul, injection rate 7 ml sl, mixing time <10 ms, 1 = 410 nm, A/ = 1nm,/= 1 cm, 
At = 1s, standard sensitivity, high tension voltage = 300 V; data averaged over 5 
injections. 

Aggregation pathway competition kinetics model. The kinetic model describes 
the competing nucleated aggregation pathways of both P-SOPV and M-SOPV via 
a sequence of differential equations. Each differential equation describes monomer 
association and dissociation to the respective aggregate with length i. Assuming 
that for i> N, with N>>n (N= 100), [X;+ 1] = «[Xj], the required number of 
differential equations was reduced to 2(N + 2). The resulting system of differential 
equations is solved using the ode15s solver in Matlab. The aggregation pathway 
competition model describes the kinetic data with parameters n = n* =5; 
a=2.9X10°M's'; K,= 1.52 X 10°M", K,/K, = 0.0526, K,*/K, = 1.38, K.*/ 
K. = 0.164 and a*/a = 3.79. The value of K. used was independently assessed by 
melting experiments. Alternatively, models using diffusion-controlled kinetics 
afforded unrealistic values of K,/K.<10~*. The Gibbs free energy diagram was 
calculated via AG? poi = —RTIn(Keor), with gas constant R, temperature T, 
equilibrium constant K = K,, fori =n and K = K, for i> and assuming the total 
SOPV hydrogen-bonded dimer concentration ¢, as the reference state, whereas 
the standard state was defined” at 1 M. The role of temperature on the aggregation 
pathway competition was rationalized by performing kinetic simulations at dif- 
ferent temperatures. The temperature-dependency of the forward rate constant 
was described by the well-known Arrhenius equation: a = Aexp(—E,¢;/RT), where 
Ais the kinetic prefactor and E,, the activation energy. The temperature-dependency 
of K, was described using the Van’t Hoff equation: K, = exp(—(AH,° - TAS.°)/RT), 
where AH.” and AS.° represent the standard enthalpy and entropy of elongation, 
respectively. The other rate constants are defined via temperature-independent 
ratios. 

Two-step non-covalent synthetic methodology. To obtain P-DTA-SOPV, the 
DTA (provided by Syncom) and SOPV were mixed in chloroform (1:1). 
Subsequently, chloroform was evaporated by purging with nitrogen gas; the residual 
DTA-SOPV complex (red colour) was dissolved in MCH (100 11M) upon sonication 
and heating until the sample turned green, indicating molecular dissolution. Cooling 
from this molecularly dissolved state to 293 K resulted in P-DTA-SOPV aggregates. 
On adding an equal volume of aqueous ethyl diamine solution (0.15 M) at 273 K, 
vigorous mixing of the water and MCH phase (1 min) and full separation of both 
phases, the top MCH layer containing metastable P-SOPV could be isolated. 
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Social networks and cooperation in hunter-gatherers 


Coren L. Apicella’?, Frank W. Marlowe’®, James H. Fowler*° & Nicholas A. Christakis” ®7 


Social networks show striking structural regularities’, and both 
theory and evidence suggest that networks may have facilitated 
the development of large-scale cooperation in humans*”’. Here, 
we characterize the social networks of the Hadza, a population of 
hunter-gatherers in Tanzania®. We show that Hadza networks have 
important properties also seen in modernized social networks, 
including a skewed degree distribution, degree assortativity, trans- 
itivity, reciprocity, geographic decay and homophily. We demon- 
strate that Hadza camps exhibit high between-group and low 
within-group variation in public goods game donations. Network 
ties are also more likely between people who give the same amount, 
and the similarity in cooperative behaviour extends up to two degrees 
of separation. Social distance appears to be as important as genetic 
relatedness and physical proximity in explaining assortativity in 
cooperation. Our results suggest that certain elements of social net- 
work structure may have been present at an early point in human 
history. Also, early humans may have formed ties with both kin and 
non-kin, based in part on their tendency to cooperate. Social net- 
works may thus have contributed to the emergence of cooperation. 

Humans are unusual as a species in the extent to which they form 
longstanding, non-reproductive unions with unrelated individuals— 
that is, we have friends. Cooperation is a defining feature of these friend- 
ships’. Humans also learn from and influence each other, evincing an 
exceptional reliance on cultural transmission’®. These facts contribute to 
the propensity of humans to form social networks, which can range in 
size from dozens to millions of people’. 

Social networks display certain empirical regularities—in settings as 
diverse as villages, schools, and workplaces—in terms of variation in the 
degree distribution (number of social ties), transitivity (the likelihood 
that two of a person’s friends are in turn friends), degree assortativity 
(the tendency of popular people to befriend other popular people), 
reciprocity (the increased likelihood of an outbound tie to be recipro- 
cated with an inbound tie from the same person), and homophily (the 
tendency of similar people to form ties). Some properties (such as a fat- 
tailed degree distribution) may be seen in many contexts (such as 
neuronal, electronic and social networks). Other properties are more 
distinctively social, and may have adaptive significance. For instance, 
degree assortativity may constrain the spread of pathogens’’, high 
transitivity may help reinforce social norms (although it can also reduce 
the flow of new information), and homophily may facilitate collective 
action”. 

However, technological advances (such as in communication, trans- 
portation and agricultural systems), demographic changes (such as in 
population density, inter-group marriage and dispersal), and social 
innovations (such as in formal institutions) have all changed the social 
landscape of humans from that in which they evolved. This raises the 
question of whether features observed in modernized social networks 
are ancient or contemporary in origin. Yet the observed regularities in 
social networks, coupled with the fact that networks can affect diverse 
individual-level outcomes, suggests that natural selection may have 
played a part in the formation of human networks. Indeed, some 
egocentric network attributes, such as the number or kind of friends 


a person has, or a person’s tendency to be central in a network, may 
have a partially genetic basis'*"*. 

The evolutionary relevance of social networks is also suggested by 
their role in cooperation. Evolutionary theories of cooperation rely on 
explicit or implicit assumptions regarding social structure’. Direct 
reciprocity assumes that the same individuals will encounter each other 
repeatedly*. Similar conclusions have been reached regarding indirect 
reciprocity’’. Other theoretical models of kin selection, generalized 
assortativity’, group competition® and social networks’ have also 
explicitly recognized the importance of population structure, showing 
that cooperation can evolve if individuals tend to interact with others of 
the same type (cooperators with cooperators and defectors with 
defectors). If real-world interactions do not exhibit such assortativity, 
then none of these theories can explain the widespread cooperation in 
humans that we observe today. 

To discover the possibly adaptive origins of human social networks, 
and their relationship to cooperation, we wanted to examine network 
features in an evolutionarily relevant setting, that is, in a population 
whose way of life is thought to resemble that of our early ancestors’. 
Although cooperation is widespread in human societies, modern hunter- 
gatherers possibly exemplify this feature best—extensively sharing 
food, labour and childcare. It is likely that the high levels of cooperation 
observed in modern hunter-gatherers were also present in early 
humans'’®. Thus far, little work has focused on networks in hunter- 
gatherers. Related topics have included estimation of the total size of 
hierarchical social units’’, examination of the role of resource produc- 
tion in social organization and residence patterns’’, and evaluation of 
food sharing’’. We know of no study that has attempted to map the 
complete social network of a foraging population to study its dyadic 
building blocks (ties between pairs of people) and macroscopic struc- 
ture, as well as the role of cooperation. 

Therefore, we performed a comprehensive, socio-centric network 
study of the Hadza hunter-gatherers of Tanzania. Connections 
between individuals were identified in two ways: subjects were asked 
with whom they would like to live in the next camp (the “campmate 
network’), and to whom they would give an actual gift of honey (the 
“gift network’) (see Methods). We studied 205 individuals, and there 
were 1,263 campmate ties and 426 gift ties. By measuring a com- 
prehensive set of statistics, we evaluated whether Hadza networks 
differ quantitatively from random networks in the same ways that 
modernized networks do. 

Cumulative distributions of in-degree (the number of times an indi- 
vidual is nominated) are shown in Fig. la. As is typical of networks’, 
the degree distributions have significantly fatter tails than a similarly 
sized group composed of individuals randomly forming the same 
number of social ties (Kolmogorov-Smirnov test, P< 10° for all 
comparisons). Degree distributions for the male and female campmate 
networks did not differ (Kolmogorov-Smirnov test, P = 0.86 for in- 
degree and 0.59 for out-degree). 

As in modernized societies”’, we find that the probability of a social 
tie decreases with increased geographic distance (see Supplementary 
Fig. 6a). Of significance to kin selection theory, we also find that the 
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Figure 1 | Structural features of modern social networks also exist in Hadza 
networks. a, Cumulative in-degree distributions show the fraction of the 
population that has at least k social ties. The distributions for the campmate and 
gift networks are significantly different from random networks with the same 
number of nodes and edges (Kolmogorov-Smirnov test, P< 10 '°) and have 
fatter tails; the random distributions are shown separately for campmate and 
gift networks (in grey). The gift networks within each camp (ordered by size of 
camp from smallest, yellow, to largest, blue) show similar distributions of in- 
degree. b, Estimates based on dyadic models of social ties (see Supplementary 
Information) show that a 1-s.d. change in similarity in characteristics between 
two people significantly increases the likelihood of a social tie (homophily). 


probability that two individuals are connected increases as the genetic 
relatedness between the pair increases, in both the campmate and the 
gift networks (see Supplementary Fig. 6b). 

We used regression analysis to evaluate the relationship between 
personal characteristics and degree (see Supplementary Information). 
In both the campmate and gift networks, age, height, weight and marital 
status are positively and significantly related to both out-degree and 
in-degree (see Supplementary Fig. 7a, b). For example, in the gift 
network, an 8.7-cm (one standard deviation, 1 s.d.) increase in height 
is associated with a 125% increase in out-degree and a 173% increase in 
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Horizontal lines indicate 95% confidence intervals. For the campmate 
networks, sex is not included because all ties are same-sex; homophily for height 
is not shown because it does not fit on the scale (the estimate is 801%, 95% 
confidence interval 549-1,148%), and homophily for cooperation is shown in 
Fig. 2c. c, Graphs of the campmate networks show that cooperators tend to be 
connected to cooperators and cluster together (see also Fig. 2b). Node colour 
and size indicates donation, shape indicates sex. Arrows point from an ego (the 
naming person) to an alter (the named person). Arrow colours indicate 
whether the ego and alter are related genetically, affinally (by marriage) or not 
at all (friendship). 


in-degree, suggesting that taller people are both more socially active and 
more socially attractive. The significance of these associations survives 
when we add numerous controls to the models, including camp-level 
fixed effects, geographic distance, genetic and affinal relationships, 
spouse relationships, age and sex (see Supplementary Information). 
Other characteristics associated with degree in at least one of the models 
include body fat, muscle mass, handgrip strength, the value placed on 
meat, and reproductive success, but none of these survive controls in 
both the campmate and gift networks, except body fat for in-degree and 
handgrip strength for out-degree (see Supplementary Information). 
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The selection of physically fit reproductive partners (in both 
traditional and modernized societies) makes sense from an evolutionary 
perspective, given the gains in resources and genetic benefits that can be 
passed on to offspring. But hunter-gatherers also prefer to form connec- 
tions to non-reproductive partners who are physically fit, suggesting that 
this tendency might also be both common and ancient in origin. Food 
acquisition and processing in foragers is labour and time-intensive, 
requiring strength and stamina, as well as skill and knowledge*". Thus, 
forming connections with physically fit individuals probably translates 
into increased resources. 

Hadza networks also resemble modernized human networks insofar 
as they too differ from random networks with respect to reciprocity'°”. 
An ‘ego’ (the naming person) is 44.2 times (95% confidence interval 
37.6-51.4) more likely to name an ‘alter’ (the named person) in the 
campmate network, and 14.3 times (95% confidence interval 12.2- 
16.4) more likely to name an alter in the gift network, if the alter 
reciprocated the social tie by also naming the ego as a friend (indeed, 
this happens even though nominations are private). Reciprocity 
remains significant even when controlling for genetic and non-genetic 
family ties, suggesting that reciprocity also exists between unrelated 
individuals. 

Hadza networks also evince degree assortativity. People with higher 
in-degree name more social contacts, and people with higher out- 
degree are more likely to be named (see Supplementary Fig. 7a, b), 
even in models with controls (including a control for reciprocity). In 
other words, individuals who nominate more friends are popular even 
among those they themselves did not nominate. 

Yet another property Hadza networks have in common with 
modernized human networks is that they have higher transitivity than 
expected in random networks”. In the campmate networks, transitivity 
is 0.17 for females and 0.16 for males, while in the gift networks, the 
average transitivity is 0.41 (see Supplementary Fig. 7c). By comparison, 
in random networks with the same number of nodes and edges, transi- 
tivity is always less than 0.01. 

Turning to homophily (Fig. 1b), in both the campmate and gift 
networks, social ties are significantly more likely when two people 
are similar in age, height, weight, body fat and handgrip strength. 
Thus, as in other human networks, hunter-gatherers who are socially 
connected tend to resemble one another. For example, a 7.5-kg (1 s.d.) 
increase in the similarity of weight is associated with a tripling of the 
probability (+201%) that two people are connected. There is also 
evidence of homophily on marital status, muscle mass, and the value 
placed on meat and baobab, but these relationships do not survive in 
the models with controls, except for the value of meat in the campmate 
network (see Supplementary Information). 

Hunter-gatherer life is characterized by imbalances in productivity 
and consumption (for example, owing to differences in strength, 
which varies across an individual’s lifetime), and this is reflected in 
divisions of labour*. Thus, one might expect that choices of 
campmates would reflect complementarity (heterophily) rather than 
homophily. On the other hand, homophily may facilitate collective 
action because similar individuals are more likely to share assump- 
tions, experiences and goals”, and also because similarity increases 
empathy, which in turn facilitates cooperation’*. We find no signifi- 
cant heterophily on any attribute examined. 

Wealso directly compared the measured Hadza parameters to values 
for 142 sociocentric networks of adolescent students in the USA and to 
two sociocentric networks of adult villagers in Honduras. The Hadza 
parameters fall within the observed ranges in these other networks, 
often near the centre of the distribution (see Supplementary Informa- 
tion). However, comparison of the precise values is limited by, among 
other things, variation in how ties were ascertained. Further research 
will be needed to clarify how details of human social network structure 
might vary across settings, if at all. 

Graphs of the Hadza social networks (Fig. 1c) show that they tend to 
be structured in a way that is relevant for cooperative behaviour, as 
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elicited in public goods games. In particular, there is homophily on 
cooperation: cooperators tend to be connected to other cooperators, 
and non-cooperators to non-cooperators. 

Although natural selection is said to favour defection in unstructured 
populations where all individuals have an equal chance of interacting 
with one another, cooperation can evolve if population structure 
permits clustering®. This feature allows cooperators to increase in the 
population because they benefit from the public goods provided by 
fellow cooperators with whom they interact. A key prediction of some 
evolutionary models is thus that there should be relatively more variance 
in cooperative behaviour between groups as compared to within groups®. 
But it was not known whether such assortment in cooperative behaviour 
actually exists in populations thought to resemble our early human 
ancestors. 

In Fig. 2a, we show a comparison of the observed variance in 
donations to the public good to the variance obtained when we keep 
the population structure fixed and randomly reshuffle the observed 
distribution of donations across all individuals. Compared to chance, 
there is significantly more between-camp variation (P = 0.01) and 
significantly less within-camp variation (P= 0.01) in cooperative 
behaviour. 
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Figure 2 | Donations in the public goods game are associated with social 
network characteristics. a, A comparison of variance in observed donations 
with variance in 1,000 simulations in which donations were randomly shuffled 
between all individuals in the population shows that between-group variance in 
cooperation is significantly higher than expected, and within-group variance is 
significantly lower than expected, at the camp level. b, An analysis of 
cooperative behaviour across all camps shows that correlation in cooperation 
extends to one degree of separation in the campmate networks and two degrees 
(to one’s friend’s friends) in the gift networks. Moreover, there is anti- 
correlation at three degrees of separation in the campmate network, suggesting 
polarization between cooperators and non-cooperators. ¢c, Correlation in 
behaviour cannot be explained by cooperators being more likely to form or 
attract social ties. Instead, subjects with similar levels of giving are significantly 
more likely to be connected at the dyadic level. d, Finally, several measures of 
proximity are independently associated with similarity in donations, but social 
proximity (the inverse of the degree of separation between two people in the 
network) appears to be just as important as genetic proximity (relatedness) and 
physical proximity (residence in the same camp) in a multivariate test. (Gift 
networks are defined only within camps and so are not presented for ‘camp’ and 
‘geographic’ proximity in Fig. 2d.) Vertical lines indicate 95% confidence 
intervals and asterisks indicate estimates with P < 0.05. See the Supplementary 
Information for details of the models. 
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We investigated the role of network connections with respect to 
group-level variation in cooperation by studying the tendency of 
cooperative individuals to be connected to other cooperators. We 
regressed public-good donations on the donations ofa person’s friends 
(see Supplementary Information). Each extra stick of honey donated is 
associated with an extra 0.13 sticks (95% confidence interval 0.05- 
0.21) donated by each friend in the campmate networks and an extra 
0.21 sticks (95% confidence interval 0.10-0.32) donated by each friend 
in the gift networks. Moreover, in the gift networks, the association 
extends to two degrees of separation; each friend’s friend donates an 
extra 0.15 sticks (95% confidence interval 0.07-0.25) for every stick a 
person donates (Fig. 2b). And, interestingly, at three degrees of sepa- 
ration, there is significant anti-correlation in the campmate networks 
(-0.04 sticks, 95% confidence interval —0.00 to -0.06), suggesting that 
cooperative and non-cooperative clusters tend to be polarized (though 
this might also reflect a finite-size effect, given the small size of Hadza 
society). 

Dyadic analyses of social ties show that people who donate more do 
not have higher out-degree or in-degree (Fig. 2c). In fact, in the gift 
networks, there is a weakly significant negative association between 
donations and in-degree, though this relationship does not survive in 
the models with controls (see Supplementary Information). This 
suggests that we can reject the hypothesis that hunter-gatherers 
unconditionally prefer to form ties with cooperators. However, there 
is significant homophily on cooperation in both the campmate and gift 
networks, and the relationship survives in the model with controls for 
the campmate network: cooperators are preferentially connected to 
other cooperators. 

To determine whether social network structure may help to explain 
variation in cooperative behaviour, we considered three different kinds 
of proximity that could be generating the similarity: geographic, 
genetic and social. If the physical environment is an important source 
of variation, then geographic proximity should help to predict similarity 
in cooperative behaviour. Additionally, people who live in the same 
camp should be more similar than those who do not. If genes are an 
important source of variation, then genetic proximity (measured as 
relatedness) should help to predict similarity in cooperative behaviour. 
But if social networks are a source of variation, then social proximity 
(measured separately for the campmate networks and gift networks by 
the inverse of the degrees of separation between two people) should help 
to predict similarity in cooperation. In separate regression models, each 
kind of proximity is significantly related to similarity in cooperation 
(see Supplementary Information). In contrast, age and sex similarity are 
not significant predictors. However, when we include all the proximity 
measures in one model, geographic proximity ceases to matter (Fig. 2d). 
Moreover, social proximity, as measured in both the campmate net- 
works and the gift networks, appears to be just as important as genetic 
proximity and camp co-residence, suggesting that cooperative beha- 
viour may be best understood as a process influenced by a combination 
of not only genes and environment, but also social networks. 

The Hadza represent possibly one of the most extreme departures 
from life in industrialized societies, and they remain relatively isolated 
from modern cultural influences. Yet all the examined properties of 
social networks seen in modernized societies also appear in the Hadza. 
Compared with random networks, Hadza networks, like modernized 
networks, exhibit a characteristic degree distribution, greater degree 
assortativity, transitivity, reciprocity and homophily than would be 
expected from chance, and a decay with geographic distance. 

To the extent that the Hadza represent our late Pleistocene ancestors*, 
the network properties and social preferences in the Hadza may indeed 
reflect elements of human sociality along with which high levels of 
human cooperation evolved. Whether certain aspects of human social 
network structure existed still further back in our hominid past is 
unclear. Evaluating the resemblances between non-human and human 
primate networks is difficult, in part because the qualitative nature of 
dyadic ties can vary considerably across species*®. Nevertheless, some 
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network properties may be quite old. For instance, age and sex predict 
both the quantity and quality of many primate interactions, and 
primate networks may demonstrate homophily’®. Possibly, certain 
aspects of social network structure might appear in any vertebrate 
species that forms social networks”, because particular structural 
features might facilitate the solution of problems common to such 
species (for example, those related to coordinated action, infection 
resistance and information transmission). 

Humans’ ability to trace descent bilaterally and form strong 
relationships with both sets of kin not only maximizes their kin ties 
but also increases their ability to move freely; once an organism is able 
to recognize paternal kin, potential inbreeding can be avoided without 
the need for evolution to favour a sex-biased dispersal pattern. 
Whereas chimpanzee females disperse and males typically spend their 
lives in their natal community, hunter-gatherers of both sexes can stay 
in or leave their natal group"*, with individuals changing camp member- 
ship throughout their lives. It is thus possible that relaxed constraints on 
social mobility patterns provided humans a greater opportunity to make 
friends, which in turn allowed cooperators more opportunities to form 
ties with other cooperators and break ties with defectors. 

Although the Hadza have a preference for kin as both campmates and 
gift recipients (indicating a potential for kin selection), the Hadza also 
actively form many ties with non-kin. In fact, recent work examining co- 
residence patterns across hunter-gatherer societies suggests that first- 
order relatives make up less than 10% of residential camps’’, raising the 
question of how high levels of cooperation are maintained in groups of 
mostly unrelated individuals. The pervasive sharing of food that char- 
acterizes hunter-gatherer life is one plausible evolutionary mechanism”, 
but theories of kin selection and reciprocal altruism, used to explain food 
sharing, have been criticized on the grounds that they require producer 
control over resource distribution”*. On the other hand, regardless of 
whether foragers have producer rights, they do maintain flexibility in 
choosing their friends and campmates, thus providing some control over 
resource distribution. 

In summary, Hadza networks are structured in a way that is con- 
sistent with the evolution of cooperative behaviour. Cooperators tend 
to be connected to cooperators at both the dyadic and network level, 
conditions necessary to sustain cooperation”*. This phenomenon cannot 
be explained by camp-level differences in the contextual environment 
because it persists in a model that controls for camp-level fixed effects. 
However, it might be explained by two alternative hypotheses. One is 
that cooperators tend to form ties preferentially with other cooperators, 
leaving defectors no choice but to form ties to the remaining non- 
cooperators”’. Another is that people may influence the cooperative 
behaviour of their networks, as demonstrated in experimental studies*’. 
But regardless of the causal mechanism, homophily on cooperation and 
selective formation of network ties create conditions that would make it 
easier for cooperative behaviour to evolve”. This suggests that social 
networks may have co-evolved with the widespread cooperation in 
humans that we observe today. 


METHODS SUMMARY 


We surveyed 205 adults in 17 Hadza camps. Cooperation was elicited by examining 
subjects’ contributions to a public good using sticks of honey. Both women and men 
donated slightly more than half of their endowment. 

We collected network data at both the population level and the camp level. We 
discerned same-sex network ties across the entire Hadza population by asking 
each individual: “With whom would you like to live after this camp ends?” We call 
this the ‘campmate network’. On average, women chose 6.0 (+1.9 s.d.) campmates 
and men chose 7.1 (+2.1s.d.) campmates. To facilitate this, we used posters con- 
taining facial photographs of a census of 517 adult Hadza (see Supplementary 
Information). 

We discerned network ties in an additional way. Every adult in each camp (100%) 
was given three sticks of honey, which they could anonymously distribute to other 
adults, of either sex, in their camp. Participants could give all the honey to one 
person or distribute it to up to three different people. We call this the ‘gift network’. 
On average, both women and men chose to give to 2.2 (+0.8 s.d.) recipients. 


©2012 Macmillan Publishers Limited. All rights reserved 


Anthropometry measures were also collected, as well as marital status, repro- 
ductive histories, and many other measurements, and we computed the genetic 
relatedness of all pairs of people. The data were analysed with regression models 
and other methods (see Supplementary Information). 


Full Methods and any associated references are available in the online version of 
the paper at www.nature.com/nature. 
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METHODS 


Over a two-month period during the summer of 2010, we surveyed 205 adults (18- 
65 years old, 103 women and 102 men) by visiting 17 distinct Hadza camps within 
approximately 3,825 square kilometres (see map in Supplementary Information). 

Cooperation was elicited by examining all subjects’ voluntary contributions to a 
public good within each camp. Subjects were endowed with four sticks of honey, 
the favourite food of the Hadza”', and they then faced the decision of how to divide 
their endowment into a private account and a public account where the goods 
would be distributed evenly with all other adult camp members who also played 
the game (with N substantially greater than 4). Subjects were instructed that they 
could give any amount to the public good or keep any amount for themselves, and 
their choices were secret (see Supplementary Information). They were told that the 
researcher would triple any amount contributed to the public good, and that, only 
after all adult camp members played the game, they would receive both the honey 
in their private account as well as their share of honey from the aggregate con- 
tributions to the public good. Both women (mean = 2.32, standard error = 0.12) 
and men (mean = 2.22; standard error = 0.12) donated slightly more than half of 
their endowment. This is the first public-good experiment conducted with the 
Hadza, and, similarly to other small-scale societies, we find that Hadza behaviour 
is not governed by pure self-interest”. 

We collected network data at both the population level and the camp level using 
a different ‘name generator’ for each*’. The Hadza live in camps consisting of 
approximately 30 individuals (including both adults and children)*; we include 
only adults here (an average of 11.7 (+6.0s.d.) adults live in each camp). Camps 
shift location every 4-6 weeks and individual membership in camps is fluid’. 
Because food that is brought into camp is widely shared”, and because sharing 
of food is thought to be one of the main benefits of group living owing to the high 
variance in individual food returns*’, choosing with whom to live is consequential. 
Hence, we discerned same-sex network ties across the entire Hadza population by 
asking each individual: “With whom would you like to live after this camp ends?” 
We call this the ‘campmate network’. The participants were instructed that they 
could choose up to a maximum of ten individuals, either currently living in their 
camp or outside their camp. On average, women chose 6.0 (+1.9 s.d.) campmates 
and men chose 7.1 (+2.1s.d.) campmates (and 3% of women and 21% of men 
chose the maximum of 10). To facilitate this process and eliminate potential data 


collection errors, we used posters containing full-frontal facial photographs of a 
census of 517 adult Hadza (see Supplementary Information). 

At the camp level, we discerned network ties in an additional, novel way. Every 
adult in each camp (100%) was given three sticks of honey, which they could 
anonymously distribute to other adults, of either sex, currently living in their 
camp. Participants were told that they could give all the sticks of honey to one 
person or distribute them to up to three different people, but, unlike in the pro- 
cedures used to elicit cooperation (which were administered separately), they 
could not keep the sticks for themselves. We call this the ‘gift network’. The 
network discerned this way yields a maximum out-degree of 3, but it is also a 
weighted network, because people could choose to give more honey to certain 
alters. On average, both women and men chose to give to 2.2 (+0.8 s.d.) recipients. 
While, ordinarily, gifts among the Hadza are not anonymous and involve repu- 
tation effects, the Hadza do understand the importance and feasibility of a gift 
given anonymously. We elected to measure ties this way because it was objective 
and could characterize phenomena such as reciprocity with limited bias. 

Finally, anthropometry measures (including height, weight, body fat percentage, 
muscle mass and hand-grip strength) were collected, as well as marital status, 
reproductive histories, and other measurements; we also computed the genetic 
relatedness of all pairs of people (see Supplementary Information). The data were 
analysed with GEE regression models***’, and confirmatory analyses were also 
done using other methods (see Supplementary Information). 
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Purkinje neuron synchrony elicits time-locked 
spiking in the cerebellar nuclei 


Abigail L. Person'+ & Indira M. Raman! 


An unusual feature of the cerebellar cortex is that its output neurons, 
Purkinje cells, release GABA (y-aminobutyric acid). Their high 
intrinsic firing rates’ (50 Hz) and extensive convergence”’ predict 
that their target neurons in the cerebellar nuclei would be largely 
inhibited unless Purkinje cells pause their spiking, yet Purkinje and 
nuclear neuron firing rates do not always vary inversely*. One indica- 
tion of how these synapses transmit information is that populations 
of Purkinje neurons synchronize their spikes during cerebellar beha- 
viours*’''. If nuclear neurons respond to Purkinje synchrony, they 
may encode signals from subsets of inhibitory inputs”’” *. Here we 
show in weanling and adult mice that nuclear neurons transmit the 
timing of synchronous Purkinje afferent spikes, owing to modest 
Purkinje-to-nuclear convergence ratios (~40:1), fast inhibitory 
postsynaptic current kinetics (Zgecay = 2.5 ms) and high intrinsic 
firing rates (~90 Hz). In vitro, dynamically clamped asynchronous 
inhibitory postsynaptic potentials mimicking Purkinje afferents 
suppress nuclear cell spiking, whereas synchronous inhibitory post- 
synaptic potentials entrain nuclear cell spiking. With partial syn- 
chrony, nuclear neurons time-lock their spikes to the synchronous 
subpopulation of inputs, even when only 2 out of 40 afferents syn- 
chronize. In vivo, nuclear neurons reliably phase-lock to regular 
trains of molecular layer stimulation. Thus, cerebellar nuclear 
neurons can preferentially relay the spike timing of synchronized 
Purkinje cells to downstream premotor areas. 

During cerebellar behaviours, on-beam Purkinje cells fire synchronous 
simple spikes with millisecond-scale precision®*, and microbands of 
Purkinje cells fire synchronous complex spikes’. The extent to which 
cerebellar nuclear neurons respond to synchrony depends on the mag- 
nitude, convergence and kinetics of Purkinje inputs. We therefore mea- 
sured these properties before assessing the consequence of Purkinje 
synchrony on nuclear firing in vitro and in vivo. 

We first recorded IPSC amplitudes evoked in nuclear neurons by 
stimulating Purkinje axons in mouse cerebellar slices (Fig. la, b). 
Maximal stimulation evoked inhibitory postsynaptic currents (IPSCs) 
up to ~200nS (mean +s.e.m., 108.3 + 13.0nS; n=17). Minimal 
stimulation evoked either failures or successes of 9.44 1.0nS 
(n = 30), the amplitudes of which resembled tetrodotoxin-sensitive 
spontaneous IPSCs’°, consistent with activation of a single fibre. The 
high unitary amplitude suggests that individual Purkinje-to-nuclear 
contacts are quite strong’®. The maximal-to-minimal conductance ratio 
indicates that functional convergence in the slice is about 12, or 
maximally ~22. Because both slicing and incomplete recruitment of 
afferents probably limit maximal IPSC amplitudes, this estimate 
represents a lower bound on functional convergence in the intact 
cerebellar nuclei. 

We obtained a second convergence estimate by considering struc- 
tural constraints on innervation (see Supplementary Discussion for 
calculations). In mouse, the Purkinje-innervated surface of a large 
nuclear cell with four dendrites (3,130 m7) maximally accommodates 
1,250 Purkinje boutons, each with a 2.5 uum? surface area; comparable 
numbers (600-1,200) are estimated for cat'’. Multiple boutons must 


arise from a single Purkinje cell, however, because the quantal content 
of a unitary Purkinje IPSC'*'® is 12-18. Given a boutonal release 
probability’® of 0.5, each Purkinje cell must contribute ~24-36 
boutons per target, predicting 34-52 Purkinje cells per nuclear cell. 
A third convergence estimate was derived from the maximal GABA, 
receptor conductance per nuclear neuron”, which yielded a mean con- 
vergence of ~20 (maximum, ~30) (Supplementary Discussion). The 
estimates from these three separate measures can be reconciled with 
classical histological measurements of 860 Purkinje cells per nuclear cell 
in the cat’, by the observation that each Purkinje bouton has 9-10 
synaptic densities*’*. Correcting for this factor reduces the predicted 
convergence to 86-96, on the order of magnitude of the present 
estimates. Thus, multiple approaches indicate a functional conver- 
gence in the intact cerebellum of tens of Purkinje neurons per nuclear 


neuron, probably 20-50 in the rodent. 
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Figure 1 | Purkinje cell convergence and fast IPSCs in the cerebellar nuclei. 
a, IPSCs evoked by maximal stimulation. Holding potential, —57 mV (junction 
potential corrected). Top: five overlaid IPSCs. Middle: IPSC amplitudes as 
stimulus strength was reduced. Bottom: maximal conductances calculated from 
measured IPSC reversal potentials (n = 17, mean Ec) = —75 mV). Bin width, 
10 nS. CbN, cerebellar nuclei; Pkj, Purkinje. b, IPSCs evoked by minimal 
stimulation (range 16-20 1A). Top: 20 overlaid IPSCs. Middle: expansion of 
box in a. Events near the zero line fell within the noise. Bottom: minimal 
conductances (n = 30). Bin width, 2 nS. c, Left: scaled IPSCs in one cell at 22 °C 
and 36 °C. Right: temperature dependence of IPSC decay t (n = 7). d, Top: 50- 
Hz train of IPSCs at 36 °C. Bottom: depression of phasic IPSCs and absence of 
tonic current during train. Data are mean + s.e.m. 
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Additional determinants of the efficacy of inhibition are the 
intrinsic firing rates and IPSC kinetics. In 17-26-day-old mice (P17- 
P26), nuclear neurons fired spontaneously at 91.5 + 7.4 spikes gt 
significantly faster than in P13 animals recorded under identical con- 
ditions'> (22.0 + 5.0 spikess '; Supplementary Fig. 1), indicating a 
developmental increase in intrinsic firing rate. At near-physiological 
temperature (36°C), the Tdecay of evoked IPSCs was 2.4 + 0.2 ms 
(Fig. 1c), making corticonuclear IPSCs among the fastest known 
GABA ,-receptor-mediated currents”. This value is >5-fold briefer 
than the Tyecay of 13.6ms of IPSCs at room temperature”’, which 
appears in previous corticonuclear models'**. When unitary IPSCs 
were mimicked with dynamic clamp, the duration of inhibition of 
spontaneously firing nuclear cells varied directly with tgecay'*"™, 
and the first post-inhibitory spike was well-timed relative to the stimu- 
us’*** (Supplementary Fig. 2). Thus, brief IPSCs minimize inhibition. 

When synchronous IPSCs were evoked at 50 Hz at 36 °C, peak IPSCs 
were depressed by 55% (n = 6), but the current decayed fully between 
stimuli (Fig. 1d), resulting in no accumulation of ‘tonic’ current’>. 
Because tonic current is largely responsible for suppressing nuclear cell 
spiking, synchronous and asynchronous IPSCs may differentially 
modulate nuclear cell output. As a preliminary test of this idea, we 
delivered trains of dynamically clamped inhibitory postsynaptic poten- 
tials (dynIPSPs) from eight afferents, each with an amplitude of 10 nS 
and a rate of 50 Hz, which were either desynchronized (equivalent to 
one non-depressing input at 400 Hz) or synchronized (Fig. 2a). Not 
surprisingly, desynchronized dynIPSPs strongly inhibited spontaneous 
firing. In contrast, upon synchronization of dynIPSPs, nuclear neuron 
firing rates jumped instantaneously, but only when Tgecay was brief 
(Fig. 2b). Moreover, with physiological taecay, nuclear cell spiking 
entrained to the synchronous input, evident as a match between the 
instantaneous firing rate and the stimulus rate (Fig. 2a). Likewise, IPSPs 
evoked by Purkinje stimulation nearly perfectly entrained nuclear neuron 
spiking between 60 and 80 Hz (Fig. 2c, d); in contrast, desynchronized 
dynIPSPs suppressed firing at all stimulus rates (Fig. 2d). Thus, in the 
extreme case of low convergence, the output of nuclear neurons is deter- 
mined by both the rate and synchrony of inhibition. 
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Figure 2 | Entrainment of nuclear neuron spiking to synchronous IPSPs. 
a, Top: response to desynchronized and synchronized dynIPSPs from 8 inputs 
at 50 Hz, Tdecay = 2.5 ms. Bottom: instantaneous firing rates of neuron in 

a, showing entrainment during synchrony. Arrows indicate onset of 
desynchronized (left) and synchronized (right) dynIPSPs. Tall spikes reflect 
overlapping dynIPSPs and spikes. b, Firing rates during desynchronized and 
synchronized 50-Hz trains with different dynIPSP kinetics (Tdecay = 2.5, 5; 

14 ms, n = 25, 18, 10). c, Entrainment to corticonuclear IPSPs (36 °C) between 
60 and 80 Hz. Arrowheads indicate stimulation times. d, Median firing rates 
(+1 quartile) in response to evoked IPSPs (closed symbols) and 
desynchronized dynIPSPs (open symbols). From 50-100 Hz, evoked, n = 8, 8, 
8, 7, 3; simulated, n = 25, 10, 10, 9, 14. Dashed line indicates unity. Data are 
mean = s.e.m., unless noted. 
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Synchronous Purkinje firing occurs in vivo’, but the proportion of 
convergent Purkinje cells that synchronize is probably <100%. We 
therefore mimicked 40 Purkinje afferents with dynamic clamp, reflecting 
a realistic functional convergence, and tested whether synchronizing 
only a subpopulation of inputs can affect cerebellar nuclear output. To 
account for synaptic depression and mimic irregular spiking in vivo”, 
each input was 5 nS, and inter-IPSP intervals were drawn from Gaussian 
distributions (coefficient of variation (CV) = 1), yielding a mean of 50 
IPSPss ‘. After 1s of asynchronous inhibition, a fixed number of 
inputs (2-20, or 5-50%) was synchronized at 20-125 Hz, while the 
remaining inputs continued to provide asynchronous inhibition, each 
at 50IPSPss | (Fig. 3a, b). Asynchronous dynIPSPs greatly slowed 
nuclear cell firing. Firing rates, however, increased with fractional syn- 
chrony (Fig. 3a, c). The averaged voltage trajectories of synchronous 
dynIPSPs preceding and lacking a spike did not differ in the depth of 
hyperpolarization (Supplementary Fig. 3). Thus, the action potentials 
are not low-threshold spikes that follow unusually large inhibitory 
stimuli. 

During trains of synchronous dynIPSPs, distinct temporal patterns 
of spiking emerged. For all frequencies tested, interspike interval dis- 
tributions clustered at multiples of the interstimulus intervals of the 
synchronous dynIPSPs (Fig. 3d), reflecting phase locking of spikes to 
the regular stimulus train. The data were combined across stimulus 
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Figure 3 | The synchronous subpopulation of Purkinje inputs sets spike 
timing of nuclear neurons. a, Nuclear neuron responses to dynIPSPs from 40 
asynchronous or partially synchronized inputs, as labelled. b, Spike rasters 
during dynIPSP trains with 50-5% synchrony. ¢, Firing rates during 20-125-Hz 
trains with 0-50% synchrony (‘Synchronized rates’). d, Normalized interspike 
interval distributions during 50% synchrony, from 40 to 100 Hz. Abscissa tick 
marks indicate multiples of the interstimulus intervals. Bin width, 2 ms. Black, 
no current injection; blue, with 200 pA direct current (DC) applied to increase 
spike number during inhibition. e, Black: polar histograms of interspike 
intervals during 50-5% synchrony for 50, 55, 67, 83, 100 and 125 Hz input. 
Each cycle is one interstimulus interval (StimI). Red: net vectors of interspike 
interval histograms. (For 50%, 25%, 10%, 5% synchrony, n = 14, 9, 7, 7.) Data 
are mean + s.e.m. 
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frequency by plotting interspike intervals as normalized polar histo- 
grams, with 0° representing multiples of the interstimulus interval 
(Fig. 3e). A net vector magnitude of 1 indicates perfect phase locking, 
and 0 indicates randomly occurring spikes. For all fractions of syn- 
chrony, the net vector was oriented close to 0° (2° + 2°) with a non-zero 
magnitude (0.18, 0.25, 0.45 and 0.52 for 5%, 10%, 25% and 50% syn- 
chrony), illustrating the temporal restructuring of spike patterns accord- 
ing to stimulus interval. 

To test whether conditions in vivo permit phase locking, we recorded 
single units from the cerebellar nuclei in anaesthetized adult mice 
(Fig. 4a). Purkinje spiking was partially synchronized by stimulating 
the molecular layer” . Indeed, for stimuli from 20-120 Hz, phase lock- 
ing was robust in all neurons confirmed histologically to be within the 
nuclei (10 cells in 10 mice; Fig. 4a, b). No phase locking was apparent in 
two neurons located outside the nuclei. The net vector of the mean polar 
histogram combined across frequencies was oriented at —3° (Fig. 4c). 
Its magnitude (0.33) was between those for 10% and 25% synchrony in 
vitro, and was greater than that of baseline (0.09, P< 0.05, within-cell 
comparisons of vector magnitudes; direction 257°). 

Stimulation modulated the mean firing rates of individual neurons 
(P < 0.05 versus baseline, within-cell comparisons). Like the results for 
different fractional synchrony in vitro, however, spike rates decreased 
in some cells (n = 7) and increased in others (n = 3; Supplementary 
Fig. 4), yielding no net change across cells (P > 0.05; Fig. 4d). Thus, the 
firing rate of nuclear cells, which is probably affected by both basal 
excitation and the fractional synchrony induced by the stimulus, does 
not obviously encode information about the rate of inhibitory inputs. 


Stimulation: 40 Hz 
80 


To evaluate whether stimulation directly excited nuclear neurons, 
we compared stimulus-to-spike latencies across conditions. In vivo 
latencies were 20.2 + 2.2ms, >5-fold greater than excitatory post- 
synaptic potential (EPSP)-to-spike latencies in vitro (3.5 + 0.5, inhibi- 
tion blocked), making direct excitation seem unlikely. Instead, in vivo 
latencies were closer to IPSP-to-spike latencies (11.9 + 1.4 ms, excita- 
tion blocked) and synchronous dynIPSP-to-spike latencies with back- 
ground inhibition (13.5 + 0.6 ms; Fig. 4e), consistent with sequential 
parallel fibre and Purkinje cell activation*’. Given the similarity of 
phase locking in vivo and in vitro with excitation blocked, these results 
suggest that the in vivo stimulation synchronizes Purkinje cell simple 
spikes, which in turn set the timing of nuclear cell spikes. The reliability 
of phase locking to regular trains of inhibitory stimuli in vivo further 
suggests that the spike patterns of a synchronized Purkinje subpopula- 
tion can be transmitted with fidelity by cerebellar nuclear neurons. 

These experiments address the long-standing question in cerebellar 
physiology of how nuclear neurons encode signals from convergent, 
inhibitory Purkinje cells firing at high basal rates’. The intrinsic tendency 
of nuclear neurons to fire permits time-locked spiking during the 
transient reduction in inhibition after a few brief IPSCs overlap. 
Even synchrony of only two afferents (5%) can influence the timing 
of nuclear cell spiking. The results provide an alternative to the idea 
that nuclear cell firing requires pauses that permit low-threshold 
rebound spikes after prolonged Purkinje activity: not only would such 
a mechanism discard information present in graded Purkinje firing 
rates, but it would also introduce ~100-ms delays in cerebellar proces- 
sing that are not evident”. 
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Figure 4 | Nuclear neurons phase-lock to molecular layer stimulation in 
vivo. a, Upper trace: response of a nuclear neuron to 40-Hz molecular layer 
stimulation (bar). Inset, recording site in the cerebellar nuclei recovered after 
focal Alexa 568 injection. Dashed lines demarcate cerebellar folia. Scale bar, 
200 um. Lower traces: expanded segments of spikes during 40-, 50- and 60-Hz 
stimulation (ticks), showing the long stimulus-to-spike latency. Stimulus 
artefacts are set to zero. Plots show interspike interval histograms for responses 
above. b, Mean normalized interspike interval distributions during molecular 
layer stimulation from 20 to 100 Hz for 4, 9, 9, 7, 5, and 7 cells respectively. 
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Abscissa tick marks indicate multiples of the interstimulus interval. Red 
baseline histogram includes intervals before and after stimulation. Peaks at 
multiples of 8.3 ms were evident in 2 out of 2 cells stimulated at 120 Hz (not 
shown). Bin width, 2 ms. c, Black: polar histograms of interspike intervals 
during stimulation across rates (left) or during baseline periods. Red: net vectors 
of polar histograms. d, Mean firing rates before (Pre, 1 s), during (Stim., 3 s) and 
after (Post) stimulation. e, Stimulus-to-spike latencies for pharmacologically 
isolated EPSPs (black), IPSPs (blue) and dynIPSPs during partial synchrony 
(green) and in vivo (red). Data are mean + s.e.m. 
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The extent to which time locking occurs during natural cerebellar 
processing will depend on the fractional synchrony of converging 
Purkinje cells, the precision of this synchrony, and the variance of 
ongoing excitation of nuclear neurons'’. Supporting the idea that syn- 
chronized Purkinje neurons converge, simple spike synchrony is 
observed in neighbouring on-beam Purkinje cells*®, and closely spaced 
Purkinje cells probably converge on a common nuclear neuron’®. 
Moreover, neighbouring Purkinje cells appear to be functionally 
related, as trans-synaptic tracing from muscles labels discrete strips 
of Purkinje cells”. 

A remaining question is how time-locked nuclear cell spikes are 
processed downstream. In the red nucleus, each interpositus axon 
makes parasagittally ramifying contacts”, and about 50 interpositus 
neurons converge onto each target, producing EPSPs with little short- 
term plasticity”. Therefore, summation of EPSPs in the red nuclei may 
drive spiking that follows the firing pattern of the synchronized Purkinje 
subpopulation. More generally, time locking by cerebellar nuclear 
neurons may be a mechanism whereby the rate and timing of signals 
from varying groups of synchronized Purkinje cells are preferentially 
transmitted to premotor areas and other cerebellar targets. 


METHODS SUMMARY 


Procedures conformed to institutional guidelines for animal care and use. 

In vitro electrophysiology. Parasagittal cerebellar slices from both sexes of C57BL/6 
13-29-day-old mice were cut’ in artificial cerebrospinal fluid (ACSF; 35 °C). 
Recordings from large nuclear neurons were made’ at 35-37 °C, unless noted, with 
a Multiclamp 700B amplifier and pClamp acquisition software. IPSCs were evoked 
in 6,7-dinitroquinoxaline-2,3-dione (DNQX; 5 11M), (RS)-3-(2-carboxypiperazin- 
4-yl)-propyl-1-phosphonic acid (CPP; 10 1M) and strychnine (2 1M) by 0.1-ms 
pulses delivered through a concentric bipolar electrode in the white matter sur- 
rounding the cerebellar nuclei. Evoked IPSCs were considered minimal when a 
fixed intensity stimulus elicited responses and failures with approximately equal 
likelihood, and maximal when increasing intensity produced no larger IPSCs. 
Dynamic clamp was implemented with an SM-2 (Cambridge Conductance) and 
a P25M real-time DSP board sampling between 10 and 25 kHz. Simulated IPSCs 
had a rise time and Tdecay of 0.1 and 2.5 ms, 0.5 and 5 ms, or 1 and 14ms, and 
reversed at —75mV. Desynchronized inputs had dynIPSP intervals distributed 
uniformly. Asynchronous dynIPSP intervals were drawn from a Gaussian distri- 
bution, created offline in IgorPro 5.05, with CV = 1. Forty independent trains were 
merged. 

In vivo electrophysiology. Adult C57BL/6 mice were anaesthetized with ketamine 
(80-100 mgkg_') and xylazine (5-10 mgkg '). A glass recording electrode con- 
taining 2 M NaCl was targeted to the cerebellar nuclei. A bipolar stimulating elec- 
trode (~300 um between poles) was placed stereotaxically in the molecular layer, 
flanking the recording electrode. Three-second stimulus trains were delivered once 
every 15s, repeated 4-20 times per frequency. Data were analysed in IgorPro. 


Full Methods and any associated references are available in the online version of 
the paper at www.nature.com/nature. 
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METHODS 


Tissue preparation. Parasagittal slices of the cerebellar nuclei were prepared as in 
ref. 15 in accordance with institutional guidelines for animal care and use. P13- 
P29 mice were deeply anaesthetized with isofluorane and transcardially perfused 
with warm (35 °C) ACSF containing (in mM): 123 NaCl, 3.5 KCl, 26 NaHCOs, 
1.25 NaH3PO,, 1.5 CaCl, 1 MgCl,, 10 glucose and equilibrated with 95/5% O,/ 
CO . Mice were then rapidly decapitated and the brains removed into ACSF 
(35 °C). Slices (300 um thick) were cut on a Vibratome (Leica VT 100S) and 
incubated in warmed (35 °C), oxygenated ACSF for at least 1h before recording. 
Electrophysiological recording. Cerebellar slices were transferred to a recording 
chamber perfused continuously with warmed (35-37 °C, unless noted), oxygenated 
ACSF ata flow rate of 2-4 ml min". Slices were visualized with infrared differential 
interference contrast microscopy and recordings were made from large neurons 
(soma diameters >15 tm). Neurons all had fast decay constants (~2.5 ms at 36°C) 
resembling the <5 ms Tyecay of glutamic acid decarboxylase (GAD)-negative large 
neurons at 34 °C rather than the ~10 ms Tdecay of GAD-positive neurons”. They 
are therefore likely to be excitatory projection neurons. Data were filtered at 6 kHz 
and sampled at 10-25 kHz. 

Voltage- and current-clamp recordings were made with a Multiclamp 700B 
amplifier (Molecular Devices) and pClamp 10.0 data acquisition software 
(Molecular Devices). Borosilicate patch pipettes were pulled to tip resistances of 
2-4 MQ and filled with an internal solution containing (mM): 130 K-gluconate, 2 
Na-gluconate, 6 NaCl, 10 HEPES, 2 MgCl, 0.1 or 1 EGTA, 14 Tris-creatine 
phosphate, 4 MgATP, 0.3 Tris-GTP and 10 sucrose. For experiments isolating 
GABA,-receptor-mediated currents, DNQX (5 UM), CPP (10 1M) and strychnine 
(2 1M) were included in the bath solution and currents were evoked by stimulating 
the white matter surrounding the cerebellar nuclei with 0.1-ms current pulses 
(Isoflex Stimulus Isolation Unit; AMPI) delivered through a concentric bipolar 
electrode (FHC). Evoked IPSCs were considered minimal when a fixed amplitude 
stimulus strength elicited IPSCs or failures with approximately equal likelihood, 
indicating that stimulation was at threshold for a single input. 

Dynamic clamp was implemented with a SM-2 system (Cambridge 
Conductance) with a P25M real-time DSP board (Innovative Integration) sampling 
between 10 and 25 kHz. Unitary conductances were set at either 10 or 5 nS and three 
kinetic profiles were used: 0.1-ms rise time and a single exponential Tgecay of 2.5 ms; 
0.5-ms rise time and Tgecay of 5 ms (ref. 15); and 1-ms rise time and tdecay 14 ms 
(ref. 21), with a reversal potential of —75 mV, accounting for a 7-mV measured 
liquid junction potential. In experiments manipulating synchrony between 0% and 
100%, eight 10-nS convergent units were simulated. Desynchronized inputs were 
designed to distribute IPSP times uniformly during the inter-stimulus-interval, 


such that each ‘unit’ was activated at a given rate but offset in time by (inter- 
stimulus-interval/8) milliseconds. Gaussian trains were created offline in 
IgorPro 5.05 (Wavemetrics). Spike times were drawn from a Gaussian distribution 
with a CV of 1. Forty independent trains were merged to represent a population of 
corticonuclear IPSP arrival times and applied to trigger inhibitory conductances. 
In vivo electrophysiology. Adult C57BL/6 mice were anaesthetized with a cocktail 
of 80-100 mg kg” ' ketamine and 5-10 mg kg” ' xylazine intraperitoneally. A small 
craniotomy above the cerebellum was made with a dental drill. A glass recording 
electrode containing 2 M NaCl with 1% Alexa 568 10K dextran amine was targeted 
to the cerebellar nuclei (in mm: 2.6-3.0 posterior to Lambda, 1.0-1.3 lateral, 2.2- 
3.1 depth). A bipolar stimulating electrode (~300 j1m between poles) was placed 
stereotaxically in the cerebellar molecular layer, aligned approximately in the 
coronal plane (on-beam), flanking the recording electrode. Stimulus trains were 
3 s, delivered once every 15s, had pulse durations of 0.1 ms and amplitudes of 40 
(n = 1), 200 (n = 6) or 360 LA (n = 5), after other studies that used 100-200 pA 
(ref. 23). A stimulus of 360 [1A is expected to stimulate a sphere with <0.5 mm 
radius*®. Each stimulus rate was delivered between 4 and 20 times to each unit. 
After each recording, dye was pressure injected to mark the recording site. Only 
one cell was recorded from each mouse. Mice were transcardially perfused with 
physiological saline, brains were collected and fixed in 4% paraformaldehyde. 
Brains were cryoprotected in 30% sucrose, sectioned to 401m on a freezing 
microtome, and visualized on a Leica laser scanning confocal microscope. 

Data analysis. Data were analysed offline in IgorPro 5.05 (Wavemetrics). Statistical 
analysis was done with paired or unpaired two-tailed f-tests as noted in the text, with 
significance taken as P< 0.05. Polar plots were constructed by binning interspike 
intervals with bin size = stimulus interval/12 for each stimulus frequency in each cell 
and averaging first across cells and then across frequencies. On each polar plot, one 
cycle is one interstimulus interval, such that a bar at 0° indicates spikes occurring at 
multiples of the interspike interval. Net vector magnitudes and directions were 
computed by summing the x and y values associated with the angle and radius 
values of the 12 vectors (histogram bars) that compose the mean polar plot. The 
null polar plot for the in vivo data was constructed from the interspike intervals of 
spikes occurring during non-stimulated periods, plotted with the same time bases as 
the stimulated interspike intervals. For statistical comparison of stimulated and null 
interspike intervals in vivo, vector magnitudes were calculated for data binned at 
0.1 ms. 


30. Tehovnik, E. J., Tolias, A. S., Sultan, F., Slocum, W. M. & Logothetis, N. K. Direct and 
indirect activation of cortical neurons by electrical microstimulation. 
J. Neurophysiol. 96, 512-521 (2006). 


©2012 Macmillan Publishers Limited. All rights reserved 


[LER 


doi:10.1038/nature10738 


Clonal evolution in relapsed acute myeloid leukaemia 
revealed by whole-genome sequencing 
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Most patients with acute myeloid leukaemia (AML) die from pro- 
gressive disease after relapse, which is associated with clonal evolu- 
tion at the cytogenetic level'’. To determine the mutational 
spectrum associated with relapse, we sequenced the primary tumour 
and relapse genomes from eight AML patients, and validated 
hundreds of somatic mutations using deep sequencing; this allowed 
us to define clonality and clonal evolution patterns precisely at 
relapse. In addition to discovering novel, recurrently mutated genes 
(for example, WAC, SMC3, DIS3, DDX41 and DAXX) in AML, we 
also found two major clonal evolution patterns during AML relapse: 
(1) the founding clone in the primary tumour gained mutations and 
evolved into the relapse clone, or (2) a subclone of the founding 
clone survived initial therapy, gained additional mutations and 
expanded at relapse. In all cases, chemotherapy failed to eradicate 
the founding clone. The comparison of relapse-specific versus 
primary tumour mutations in all eight cases revealed an increase 
in transversions, probably due to DNA damage caused by cytotoxic 
chemotherapy. These data demonstrate that AML relapse is asso- 
ciated with the addition of new mutations and clonal evolution, 
which is shaped, in part, by the chemotherapy that the patients 
receive to establish and maintain remissions. 

To investigate the genetic changes associated with AML relapse, and 
to determine whether clonal evolution contributes to relapse, we per- 
formed whole-genome sequencing of primary tumour-relapse pairs 
and matched skin samples from eight patients, including unique patient 
identifier (UPN) 933124, whose primary tumour mutations were previ- 
ously reported’. Informed consent explicit for whole-genome sequen- 
cing was obtained for all patients on a protocol approved by the 
Washington University Medical School Institutional Review Board. 
We obtained >25 haploid coverage and >97% diploid coverage for 
each sample (Supplementary Table 1 and Supplementary Information). 
These patients were from five different French-American-British 
haematological subtypes, with elapsed times of 235-961 days between 
initial diagnosis and relapse (Supplementary Table 2a, b). 

Candidate somatic events in the primary tumour and relapse genomes 
were identified*° and selected for hybridization capture-based validation 
using methods described in Supplementary Information. Deep sequen- 
cing of the captured target DNAs from skin (the matched normal tissue), 
primary tumour and relapse tumour specimens® (Supplementary 
Table 3) yielded a median of 590-fold coverage per site. The average 
number of mutations and structural variants was 539 (range 118- 
1,292) per case (Fig. 1a). 

The general approach for relapse analysis is exemplified by the first 
sequenced case (UPN 933124). A total of 413 somatic events from tiers 


1 to 3 were validated (see ref. 7 for tier designations; Supplementary 
Fig. la and Supplementary Tables 4a and 5). Of these, 78 mutations 
were relapse-specific (63 point mutations, 1 dinucleotide mutation, 13 
indels and 1 translocation; relapse-specific criteria described in Sup- 
plementary Information and shown in Supplementary Fig. 1b), 5 point 
mutations were primary-tumoutr-specific, and 330 (317 point mutations 
and 13 indels) were shared between the primary tumour and relapse 
samples (Fig. 1a, b and Supplementary Fig. 2). The skin sample was 
contaminated with leukaemic cells for this case (peripheral white blood 
cell count was 105,000 cells mm? when the skin sample was banked), 
with an estimated tumour content in the skin sample of 29% 
(Supplementary Information). In addition to the ten somatic non- 
synonymous mutations originally reported for the primary tumour 
sample’, we identified one deletion that was not detected in the original 
analysis (DNMT3A L723fs (ref. 8)) and three mis-sense mutations previ- 
ously misclassified as germline events (SMC3 G662C, PDXDC1 E421K 
and TTN E14263K) (Fig. 1b, Table 1 and Supplementary Table 4b). 

A total of 169 tier 1 coding mutations (approximately 21 per case) 
were identified in the eight patients (Table 1 and Supplementary 
Tables 4b and 6), of which 19 were relapse-specific. In addition to 
mutations in known AML genes such as DNMT3A (ref. 8), FLT3 
(ref. 9), NPM1 (ref. 10), IDHI1 (ref. 7), IDH2 (ref. 11), WT1 (ref. 12), 
RUNX]1 (refs 13, 14), PTPRT (ref. 3), PHF6 (ref. 15) and ETV6 (ref. 16) 
in these eight patients, we also discovered novel, recurring mutations 
in WAC, SMC3, DIS3, DDX41 and DAXX using 200 AML cases whose 
exomes were sequenced as part of the Cancer Genome Atlas AML 
project (Table 1, Supplementary Table 4b and Supplementary Fig. 3; 
T.J.L., R.K.W. and The Cancer Genome Atlas working group on AML, 
unpublished data). Details regarding the novel, recurrently mutated 
genes are provided in Table 1, Supplementary Tables 4b and 7 and 
Supplementary Figs 3 and 4. Structural and functional analyses of 
structural variants are presented in the Supplementary Information 
(Supplementary Figs 5-10 and Supplementary Tables 2, 8 and 9). 

The generation of high-depth sequencing data allowed us to quantify 
accurately mutant allele frequencies in all cases, permitting estimation 
of the size of tumour clonal populations in each AML sample. On the 
basis of mutation clustering results, we inferred the identity of four 
clones having distinct sets of mutations (clusters) in the primary 
tumour of AML1/UPN 933124 (Supplementary Information). The 
median mutant allele frequencies in the primary tumour for clusters 
1 to 4 were 46.86%, 24.89%, 16.00% and 2.39%, respectively (Fig. 1b and 
Supplementary Table 5c). Clone 1 is the ‘founding’ clone (that is, the 
other subclones are derived from it), containing the cluster 1 mutations; 
assuming that nearly all of these mutations are heterozygous, they must 
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Figure 1 | Somatic mutations quantified by deep sequencing of capture 
validation targets in eight acute myeloid leukaemia primary tumour and 
relapse pairs. a, Summary of tier 1-3 mutations detected in eight cases (not 
including translocations). All mutations shown were validated using capture 
followed by deep sequencing. Shared mutations are in grey, primary tumour- 
specific mutations in blue and relapse-specific mutations in red. The total 
number of tier 1-3 mutations for each case is shown above the light-grey 
rectangle. b, Mutant allele frequency distribution of validated mutations from 
tier 1-3 in the primary tumour and relapse of case UPN 933124 (left). Mutant 
allele frequencies for five primary-tumour-specific mutations were obtained 
from a 454 deep read-count experiment. Four mutation clusters were identified 
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in the primary tumour, and one was found at relapse. Five low-level mutations 
in both the primary tumour and relapse (including four residing in known copy 
number variable regions) were excluded from the clustering analysis. Non- 
synonymous mutations from genes that are recurrently mutated in AML are 
shown. The change of mutant allele frequencies for mutations from the five 
clusters is shown (right) between the primary tumour and relapse. The orange 
and red lines are superimposed. c, The mutation clusters detected in the 
primary tumour and relapse samples from seven additional AML patients. The 
relationship between clusters in the primary tumour and relapse samples are 
indicated by lines linking them. 


Table 1 | Coding mutations identified in eight primary tumour-relapse pairs 


UPN Total tier 1 mutations Recurrently mutated genes Relapse-specific non-synonymous 
(primary/relapse) in primary tumour somatic mutations 

452198 9/9 DNMT3A, NPM1, FLT3, IDH1 None 
573988 6/8 NPM1, IDH2 STOX2 
804168 22/26 FLT3, WT1, PHF6, FAMS5C, TTC39A SLC25A12, RIPK4, ABCD2 
933124 14/17 DNMT3A, NPM1, FLT3, TTN, SMC3, PTPRT ETV6*, MYO18B*, WAC*+, STK4 
400220 12/13 FLT3, RUNX1, WT1, PLEKHH1 None 
426980 32/35 IDH2, MYO1F, DDX4 GBP4, DCLK1, IDH2*, DCLK1*, ZNF260 
758168 15/19 DNAH9, DIS3, CNTN5, PML-RARA+ ENSG00000180144, DAGLA* 
869586 51/50 RUNX1, WT1, TTN, PHF6, NF1, SUZ12, NCOA7, EED, DAXX, ACSS3, WAC, NUMA1 None 


Tier 1 mutation counts exclude RNA genes. 
*Recurrent mutations occurring in relapse sample. 
+ Translocations were not included in tier 1 mutation counts. 
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be present in virtually all the tumour cells at presentation and at relapse, 
as the variant frequency of these mutations is ~40-50%. Clone 2 (with 
cluster 2 mutations) and clone 3 (with cluster 3 mutations) must be 
derived from clone 1, because virtually all the cells in the sample contain 
the cluster 1 mutations (Fig. 2a). It is likely that a single cell from clone 3 
gained a set of mutations (cluster 4) to form clone 4: these survived 
chemotherapy and evolved to become the dominant clone at relapse. 
We do not know whether any of the cluster 4 mutations conferred 
chemotherapy resistance; although none had translational consequences, 
we cannot rule out a relevant regulatory mutation in this cluster. 
Assuming that all the mutations detected are heterozygous in the 
primary tumour sample (with a malignant cellular content at 93.72% 
for the primary bone marrow sample, see Supplementary Informa- 
tion), we were able to calculate the fraction of total malignant cells in 
each clone. Clone 1 is the founding clone; 12.74% of the tumour cells 
contain only this set of mutations. Clones 2, 3 and 4 evolved from clone 
1. The additional mutations in clones 2 and 3 may have provided a 
growth or survival advantage, as 53.12% and 29.04% of the tumour 
cells belonged to these clones, respectively. Only 5.10% of the tumour 
cells were from clone 4, indicating that it may have arisen last (Fig. 2a). 
However, the relapse clone evolved from clone 4. A single clone 


a Clonal fractions at initial diagnosis 


12.74% 


29.04% 


«HSCs 

wo 5.10% 
> * DNMT3A, NPM1, FLT3, PTPRT, SMC3 

a 

AML1 
Cell type: Mutations: 
® Nerinal @ AML ° Founding (cluster 1) i 
®@ Primary specific (cluster 2) | ® Relapse enriched (cluster 4) 

b 


Model 1 (UPNs 400220, 573988, 804168) 


Chemotherapy 


Model 2 (UPNs 426980, 452198, 758168, 869586, 933124) 


ae 
ee 


Chemotherapy 


Figure 2 | Graphical representation of clonal evolution from the primary 
tumour to relapse in UPN 933124, and patterns of tumour evolution 
observed in eight primary tumour and relapse pairs. a, The founding clone in 
the primary tumour in UPN 933124 contained somatic mutations in 
DNMT3A, NPM1, PTPRT, SMC3 and FLT3 that are all recurrent in AML and 
probably relevant for pathogenesis; one subclone within the founding clone 
evolved to become the dominant clone at relapse by acquiring additional 
mutations, including recurrent mutations in ETV6 and MYO18B, and a 
WNK1-WAC fusion gene. HSC, haematopoietic stem cell. b, Examples of the 
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containing all of the cluster 5 mutations was detected in the relapse 
sample; clone 5 evolved from clone 4, but gained 78 new somatic 
alterations after sampling at day 170. As all mutations in clone 5 appear 
to be present in all relapse tumour cells, we suspect that one or more of 
the mutations in this clone provided a strong selective advantage that 
contributed to relapse. The ETV6 mutation, the MYO18B mutation, 
and/or the WNK1-WAC fusion are the most likely candidates, as 
ETV6, MYO18B and WAC are recurrently mutated in AML. 

We evaluated the mutation clusters in the seven additional primary 
tumour-relapse pairs by assessing peaks of allele frequency using 
kernel density estimation (Supplementary Fig. 11 and Supplemen- 
tary Information). We thus inferred the numbers and malignant frac- 
tions of clones in each primary tumour and relapse sample. Similar to 
UPN 933124, multiple mutation clusters (2-4) were present in each of 
the primary tumours from four patients (UPN 869586, UPN 426980, 
UPN 452198 and UPN 758168). However, only one major cluster was 
detected in each of the primary tumours from the three other patients 
(UPN 804168, UPN 573988 and UPN 400220) (Fig. lc and Sup- 
plementary Table 10). Importantly, all eight patients gained relapse- 
specific mutations, although the number of clusters in the relapse 
samples varied (Fig. 1). 
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two major patterns of tumour evolution in AML. Model 1 shows the dominant 
clone in the primary tumour evolving into the relapse clone by gaining relapse- 
specific mutations; this pattern was identified in three primary tumour and 
relapse pairs (UPN 804168, UPN 573988 and UPN 400220). Model 2 shows a 
minor clone carrying the vast majority of the primary tumour mutations 
survived and expanded at relapse. This pattern was observed in five primary 
tumour and relapse pairs (UPN 933124, UPN 452198, UPN 758168, UPN 
426980 and UPN 869586). 
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Two major patterns of clonal evolution were detected at relapse 
(Fig. 2b and Supplementary Fig. 3): in cases with pattern 1, the dominant 
clone in the primary tumour gained additional mutations and evolved 
into the relapse clone (UPN 804168, UPN 573988 and UPN 400220). 
These patients may simply be inadequately treated (for example, elderly 
patients who cannot tolerate aggressive consolidation, like UPN 
573988), or they may have mutations in their founding clones (or 
germline variants) that make these cells more resistant to therapy 
(UPN 804168 and UPN 400220). In patients with pattern 2, a minor 
subclone carrying the vast majority (but not all) of the primary tumour 
mutations survived, gained mutations, and expanded at relapse; a sub- 
set of primary tumour mutations was often eradicated by therapy, and 
were not detected at relapse (UPN 758168, UPN 933124, UPN 452198, 
UPN 426980 and UPN 869586). Specific mutations in a key subclone 
may contribute to chemotherapy resistance, or the mutations import- 
ant for relapse may be acquired during tumour evolution, or both. 
Notably, in cases 426980 and 758168, a second primary tumour clone 
survived chemotherapy and was also present at relapse (Fig. 1c and 
Supplementary Fig. 3). Owing to current technical limits in our ability 
to detect mutations in rare cells (mostly related to currently achievable 
levels of coverage with whole genome sequencing), our models repres- 
ent a minimal estimate of the clonal heterogeneity in AML. 

All eight patients received cytarabine and anthracycline for induc- 
tion therapy, and additional cytotoxic chemotherapy for consolidation; 
treatment histories are summarized in Supplementary Table 2 and 
described in Supplementary Information. To investigate the potential 
impact of treatment on relapse mutation types, we compared the six 
classes of transition and transversion mutations in the primary tumour 
with the relapse-specific mutations in all eight patients (Fig. 3a). 
Although C*G->TeA transitions are the most common mutations 
found in both primary and relapse AML genomes, their frequencies 
are significantly different between the primary tumour mutations 
(51.1%) and relapse-specific mutations (40.5%) (P= 2.99 x 10 7). 
Moreover, we observed an average of 4.5%, 5.3% and 4.2% increase 
in AeT>CeG (P=9.13 X10’), CeG>AsT (P=0.00312) and 
CeG—GeC (P=0.00366) transversions, respectively, in relapse- 
specific mutations. Notably, an increased Ae T—CeG transversion rate 
has also been observed in cases of chronic lymphocytic leukaemia with 
mutated immunoglobulin genes'’. CeG—>AsT transversions are the 
most common mutation in lung cancer patients who were exposed to 
tobacco-borne carcinogens’ (Fig. 3b and Supplementary Table 11). We 
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Figure 3 | Comparison of mutational classes between primary tumours and 
relapse samples. a, Fraction of the primary tumour and relapse-specific 
mutations in each of the transition and transversion categories. b, Transversion 
frequencies of the primary tumour and relapse-specific mutations from eight 
AML tumour and relapse pairs. 456 relapse-specific mutations and 3,590 
primary tumour mutations from eight cases were used for assessing statistical 
significance using proportion tests. 
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examined the 456 relapse-specific mutations and 3,590 primary tumour 
point mutations from all eight cases as a group, and found that the 
transversion frequency is significantly higher for relapse-specific 
mutations (46%) than for primary tumour mutations (30.7%) 
(P=3.71X 10 "), indicating that chemotherapy has a substantial 
effect on the mutational spectrum at relapse. Similar results were 
obtained when we limited the analysis to the 213 mutations that had 
0% variant frequency in the primary tumour samples (Supplementary 
Fig. 1b); the transversion frequency for relapse-specific mutations was 
50.4%, versus 31.4% for primary tumour samples (P = 3.89 X 10”). 
Very few copy-number alterations were detected in the eight relapse 
samples, suggesting that the increased transversion rate is not asso- 
ciated with generalized genomic instability (Supplementary Fig. 12). 

We first described the use of deep sequencing to define precisely the 
variant allele frequencies of the mutations in the AML genome of case 
933124 (ref. 3), and here have refined and extended this technique to 
examine clonal evolution at relapse. The analysis of eight primary 
AMLand relapse pairs has revealed unequivocal evidence for a common 
origin of tumour subpopulations; a dominant mutation cluster repre- 
senting the founding clone was discovered in the primary tumour 
sample in all cases. The relationship of the founding clone (and sub- 
clones thereof) to the ‘leukaemia initiating cell’ is not yet clear— 
purification of clonal populations and functional testing would be 
required to establish this relationship. We observed the loss of primary 
tumour subclones at relapse in four of eight cases, suggesting that some 
subclones are indeed eradicated by therapy (Figs 1 and 2 and Sup- 
plementary Fig. 3). Some mutations gained at relapse may alter the 
growth properties of AML cells, or confer resistance to additional 
chemotherapy. Regardless, each tumour displayed clear evidence of 
clonal evolution at relapse and a higher frequency of transversions that 
were probably induced by DNA damage from chemotherapy. 
Although chemotherapy is required to induce initial remissions in 
AML patients, our data also raise the possibility that it contributes to 
relapse by generating new mutations in the founding clone or one of its 
subclones, which then can undergo selection and clonal expansion. 
These data demonstrate the critical need to identify the disease-causing 
mutations for AML, so that targeted therapies can be developed that 
avoid the use of cytotoxic drugs, many of which are mutagens. 

This study extends the findings of previous studies'?*', which 
recently described patterns of clonal evolution in ALL patients using 
fluorescence in situ hybridization and/or copy number alterations 
detected by SNP arrays, and it enhances the understanding of genetic 
changes acquired during disease progression, as previously described 
for breast and pancreatic cancer metastases **. Our data provide 
complementary information on clonal evolution in AML, using a 
much larger set of mutations that were quantified with deep sequen- 
cing; this provides an unprecedented number of events that can be 
used to define precisely clonal size and mutational evolution at relapse. 
Both ALL and AML share common features of clonal heterogeneity at 
presentation followed by dynamic clonal evolution at relapse, includ- 
ing the addition of new mutations that may be relevant for relapse 
pathogenesis. Clonal evolution can also occur after allogeneic trans- 
plantation (for example, loss of mismatched HLA alleles via a uni- 
parental disomy mechanism), demonstrating that the type of therapy 
itself can affect clonal evolution at relapse**”’. Taken together, these 
data demonstrate that AML cells routinely acquire a small number of 
additional mutations at relapse, and suggest that some of these muta- 
tions may contribute to clonal selection and chemotherapy resistance. 
The AML genome in an individual patient is clearly a ‘moving target’; 
eradication of the founding clone and all of its subclones will be 
required to achieve cures. 


METHODS SUMMARY 


Illumina paired-end reads were aligned to NCBI build36 using BWA 0.5.5 (http:// 
sourceforge.net/projects/bio-bwa/). Somatic mutations were identified using 
SomaticSniper* and a modified version of the SAMtools indel caller. Structural 


26 JANUARY 2012 | VOL 481 | NATURE | 509 


©2012 Macmillan Publishers Limited. All rights reserved 


LETTER 


variations were identified using BreakDancer’. All predicted non-repetitive so- 
matic SNVs, indels and all structural variants were included on custom sequence 
capture arrays from Roche Nimblegen. Illumina 2 X 100-bp paired-end sequen- 
cing reads were produced after elution from capture arrays. VarScan® and a read 
remapping strategy using Crossmatch (P. Green, unpublished data) and BWA 
were used for determining the validation status of predicted SNVs, indels and 
structural variants. A complete description of the materials and methods is pro- 
vided in Supplementary Information. All sequence variants for the AML tumour 
samples from eight cases have been submitted to dbGaP (accession number 
phs000159.v4.p2). 
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Exercise-induced BCL2-regulated autophagy is 
required for muscle glucose homeostasis 


Congcong He!*3*, Michael C. Bassik*+*, Viviana Moresi’, Kai Sun”°, Yongjie Wei”, Zhongju Zou’, Zhenyi An"?, Joy Loh’, 
Jill Fisher*, Qihua Sun'’, Stanley Korsmeyer*}, Milton Packer’, Herman I. May’, Joseph A. Hill’, Herbert W. Virgin’, 
Christopher Gilpin’, Guanghua Xiao®, Rhonda Bassel-Duby”, Philipp E. Scherer”® & Beth Levine’?3° 


Exercise has beneficial effects on human health, including protec- 
tion against metabolic disorders such as diabetes’. However, the 
cellular mechanisms underlying these effects are incompletely 
understood. The lysosomal degradation pathway, autophagy, is an 
intracellular recycling system that functions during basal conditions 
in organelle and protein quality control’. During stress, increased 
levels of autophagy permit cells to adapt to changing nutritional and 
energy demands through protein catabolism’. Moreover, in animal 
models, autophagy protects against diseases such as cancer, neuro- 
degenerative disorders, infections, inflammatory diseases, ageing 
and insulin resistance**. Here we show that acute exercise induces 
autophagy in skeletal and cardiac muscle of fed mice. To investigate 
the role of exercise-mediated autophagy in vivo, we generated 
mutant mice that show normal levels of basal autophagy but are 
deficient in stimulus (exercise- or starvation)-induced autophagy. 
These mice (termed BCL2 AAA mice) contain knock-in mutations 
in BCL2 phosphorylation sites (Thr69Ala, Ser70Ala and Ser84Ala) 
that prevent stimulus-induced disruption of the BCL2-beclin-1 
complex and autophagy activation. BCL2 AAA mice show decreased 
endurance and altered glucose metabolism during acute exercise, 
as well as impaired chronic exercise-mediated protection against 
high-fat-diet-induced glucose intolerance. Thus, exercise induces 
autophagy, BCL2 is a crucial regulator of exercise- (and starvation)- 
induced autophagy in vivo, and autophagy induction may con- 
tribute to the beneficial metabolic effects of exercise. 

To investigate whether autophagy is induced by exercise, we ana- 
lysed tissues of mice that transgenically express a green fluorescent 
protein (GFP)-labelled marker of autophagosomes, GFP-LC3 (LC3 is 
also known as MAP1LC3) (ref. 7), after treadmill exercise. In both 
skeletal and cardiac muscle, autophagosome (GFP-LC3 puncta) num- 
bers increased after 30 min (~300 m) of running and reached a plateau 
at 80 min (~900 m) (Fig. la—c). This was observed in several muscle 
groups, including the vastus lateralis (thigh muscle) (Fig. la—-c), soleus, 
tibialis anterior and extensor digitorum longus (Supplementary Fig. 1). 
Exercise also resulted in biochemical evidence of skeletal and cardiac 
muscle autophagy, including conversion of the non-lipidated form of 
LC3, LC3-I, to the autophagosome-membrane-associated lipidated 
form, LC3-II, and degradation of the autophagy substrate protein 
p62 (Fig. 1d). In addition, exercise induced autophagy in other organs 
involved in glucose and energy homeostasis, such as liver and pancreas, 
including islet B-cells (Supplementary Fig. 2a—f) and (based on LC3-II 
conversion) adipose tissue (Supplementary Fig. 2g). Thus, exercise is a 
newly defined stimulus that induces autophagy in vivo. 


We found that exercise-induced autophagy in vivo involves disrup- 
tion of the BCL2-beclin-1 complex. BCL2 is an anti-apoptotic and 
anti-autophagy protein that inhibits autophagy through a direct inter- 
action with the BH3 domain of the autophagy protein beclin 1 at the 
endoplasmic reticulum*. Disruption of the BCL2-beclin-1 complex is 
crucial for stimulus-induced autophagy in mammalian cells*. Beclin 1 
immunoprecipitation with BCL2 decreased in muscle within 15 min of 
exercise and was barely detectable after 30 min (Fig. le). Dissociation 
of the BCL2-beclin-1 complex was not associated with phosphoryla- 
tion of kinases previously linked to BCL2 phosphorylation, such as 
JNK and p38 MAPK? (Supplementary Fig. 3). 

To study the physiological functions of exercise-induced autophagy 
in vivo, we constructed mutant mice that are deficient in stimulus- 
induced but not basal autophagy. As phosphorylation of three sites in 
the non-structured loop of human BCL2 is critical for stimulus- 
induced autophagy in vitro’, we generated knock-in (Bcl2“*“) mice 
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Figure 1 | Exercise induces autophagy in skeletal and cardiac muscle. 

a, b, Representative images of GFP-LC3 puncta (autophagosomes) in skeletal 
(vastus lateralis) (a) and cardiac (b) muscle from GFP-LC3 transgenic mice at 
serial time points after exercise. Scale bar, 20 um. c, Quantification of data 
(mean + s.e.m. of 10 tissue sections) ina and b. **P < 0.01, ***P < 0.001 (one- 
way ANOVA). d, Western blot analysis of LC3-I/II (non-lipidated and 
lipidated forms of MAP1LC3, respectively) and p62 levels in indicated tissue 
from mice at rest (—) or after maximal exercise (+). Skeletal and cardiac 
indicate skeletal and cardiac muscle, respectively. e, Co-immunoprecipitation 
of beclin 1 with BCL2 in muscle tissue from mice at indicated time points after 
exercise. IP, immunoprecipitate; WB, western blot; WCL, whole cell lysates. 
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lacking three conserved phosphorylation residues in the non-structured 
loop region of mouse BCL2: Thr 69, Ser 70 and Ser 84 (homologous to 
human Ser 87) were replaced by alanines (BCL2 AAA) (Supplementary 
Fig. 4a-c). Homozygous Bcl2“*“ mice were viable and fertile, born in 
the expected Mendelian ratio, of normal size and weight, and displayed 
normal histology of major organs (data not shown). Steady-state levels 
of BCL2 AAA in Bcl2**4 mice were similar to those of wild-type (WT) 
BCL2 in Bcl2“* mice in skeletal and cardiac muscle, liver, adipose tissue 
and pancreas (Supplementary Fig. 4d). 

Murine embryonic fibroblasts (MEFs) derived from BCL2 AAA mice 
showed a defect in stimulus-induced autophagy (Fig. 2a, b). In response 
to starvation, BCL2 underwent phosphorylation, BCL2 dissociated from 
beclin 1, and autophagy was induced in MEFs from isogenic wild-type 
mice. However, in BCL2 AAA MEFs, BCL2 phosphorylation was absent 
(as measured by p32 labelling), the BCL2-beclin-1 complex was not 
disrupted, and less autophagy was induced. Levels of basal autophagy 
were similar in wild-type and BCL2 AAA MEFs. Similar findings were 
observed in vivo in skeletal and cardiac muscle (Supplementary Fig. 5); 
autophagosome numbers were similar at baseline in wild-type and BCL2 
AAA mice expressing GFP-LC3 but failed to increase in GFP-LC3 
BCL2 AAA mice in response to 48 h starvation. 

To evaluate whether BCL2 AAA mice are deficient in exercise- 
induced autophagy, we exercised GFP-LC3 wild-type mice and 
GFP-LC3 BCL2 AAA mice for a fixed time and fixed distance 
(80 min (~900m)) and at 75% of their maximal running capacity 
(see Fig. 3a). Under both conditions, BCL2 AAA mice displayed 
marked impairment of exercise-induced upregulation of skeletal and 
cardiac muscle (as measured by numbers of GFP-LC3 puncta) (Fig. 2c, 
d and Supplementary Fig. 1b). BCL2 AAA mice also exhibited less 
exercise-induced LC3-II conversion and p62 degradation in skeletal 
and cardiac muscle (Fig. 2e), impaired autophagic responses in liver 
and pancreatic B-cells (Supplementary Fig. 2c, f), and defective exercise- 
induced dissociation of the BCL2-beclin-1 complex in muscle (Fig. 2f). 
Thus, non-phosphorylatable BCL2 does not alter basal autophagy in 
vivo, but prevents autophagy activation in response to starvation and 
exercise. This blockade of autophagy activation was not associated with 
increased cell death (Supplementary Fig. 6). 


The deficient exercise-induced autophagy in BCL2 AAA mice was 
accompanied by lower maximal exercise capacity (Fig. 3a). BCL2 AAA 
mice had similar levels of serum lactate after maximal exercise, similar 
baseline muscle strength, and similar baseline cardiac function and 
size as wild-type mice (Supplementary Fig. 7). Characterization of 
various muscle groups (soleus, tibialis anterior and extensor digitorum 
longus) from wild-type and BCL2 AAA mice at baseline and after 
80 min of exercise revealed no differences in weight, fibre cross-sectional 
area, morphology (haematoxylin and eosin staining), fibre type analysis, 
glycogen content, or mitochondrial content and functionality (Fig. 3b 
and Supplementary Figs 8-10). These analyses suggest that differences in 
baseline cardiac and skeletal muscle properties do not account for 
decreased exercise endurance in BCL2 AAA mice. 

During strenuous exercise, mammals undergo metabolic changes to 
increase skeletal muscle glucose uptake/utilization efficiency, includ- 
ing increased insulin sensitivity and redistribution of glucose trans- 
porters, such as GLUT4 (also known as SLC2A4), to the plasma 
membrane’™"', which is essential for exercise-stimulated glucose 
uptake’”. However, BCL2 AAA mice demonstrated impaired exercise- 
induced increase in insulin sensitivity, as reflected by less of a decline in 
plasma glucose (Fig. 3c) and plasma insulin (Fig. 3d) levels than in 
wild-type mice. Also, unlike wild-type mice, BCL2 AAA mice failed to 
exhibit increased plasma membrane GLUT4 localization in the vastus 
lateralis and soleus muscles following maximal or 80min exercise, 
respectively (Fig. 3e and Supplementary Figs 1la and 12a) and they 
exhibited lower levels of radiolabelled glucose uptake in soleus muscle 
(Fig. 3h). 

Adenosine monophosphate-activated protein kinase (AMPK) has a 
central role in enhanced GLUT4 plasma membrane localization and 
skeletal muscle glucose uptake during exercise’*. BCL2 AAA mice had 
a notable decrease in AMPK activation, as measured by levels of 
phosphorylation at residue threonine 172, in both the vastus lateralis 
(Fig. 3f and Supplementary Fig. 11b) and in the soleus (Supplementary 
Fig. 12b) muscles. Phosphorylation of the downstream AMPK target 
acetyl-CoA carboxylase (ACC) paralleled AMPK phosphorylation 
(Supplementary Fig. 13), providing additional support for a defect in 
exercise-induced AMPK activation in BCL2 AAA mice. 
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Figure 2 | Non-phosphorylatable BCL2 AAA knock-in mutations block 
BCL2 phosphorylation, BCL2-beclin 1 dissociation, and starvation- and 
exercise-induced autophagy. a, Analysis of BCL2 phosphorylation (detected 
by anti-BCL2 immunoprecipitation and autoradiography of *’P-labelled cells) 
and beclin 1 co-immunoprecipitation with BCL2 in wild-type (WT) or BCL2 
AAA MEFs grown in normal media or subjected to 4h Earle’s balanced salt 
solution (EBSS) starvation. p-BCL2, phospho-BCL2. b, Quantification of GFP- 
LC3 puncta (autophagosomes) in MEFs of indicated genotype in normal 
growth conditions or starvation conditions. Data represent mean + s.e.m. for 
100 cells per well of triplicate samples per condition. Similar results were 
observed in three independent experiments. c, Representative images of GFP- 
LC3 puncta (autophagosomes) in skeletal and cardiac muscle of GFP-LC3 
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wild-type and GFP-LC3 BCL2 AAA mice before exercise, after 80 min exercise, 
or after 75% of maximal exercise capacity. Scale bar, 20 um. d, Quantification of 
data (mean + s.d. of 4 mice per experimental group) in c. e, Western blot 
analysis of LC3-I/II and p62 levels in indicated tissue from mice of indicated 
genotype at rest (—) or after maximal exercise (+). Skeletal and cardiac indicate 
skeletal and cardiac muscle, respectively. f, Co-immunoprecipitation of beclin 1 
with BCL2 in muscle tissue from mice of indicated genotype at rest (—) or after 
30 min of exercise. WCL, whole cell lysates. NS, not significant. *P < 0.05, 
**P < 0.01, ***P < 0.001, one-way ANOVA for comparison between groups; 
+P <0.001, two-way ANOVA for comparison of magnitude of changes 
between different groups in mice of different genotypes. 
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Figure 3 | BCL2 AAA mice show deficient exercise endurance and 
alterations in muscle glucose metabolism. a, Maximal treadmill running 
distance for mice of indicated genotype. Data represent mean + s.e.m. of 5 mice 
per group. b, Representative haematoxylin and eosin (H & E) and periodic acid- 
Schiff (PAS) staining in tibialis anterior muscle sections from mice of indicated 
genotype. Scale bar, 20 1m. ¢, d, Plasma glucose (c) and insulin (d) levels in 
mice of indicated genotype at rest, after 80 min exercise (~900 m), or maximal 
exercise. Data represent combined mean = s.e.m. for 9-11 mice per group from 
three independent cohorts; similar results were observed in each cohort. 

e, Representative images of GLUT4 immunofluorescence staining in vastus 
lateralis muscle of mice of indicated genotype before and after maximal 
exercise. Scale bar, 20 ttm. For b and e, similar results were observed in 3 mice 
per group. f, Western blot analysis of AMPK phosphorylation (p-AMPK 
(Thr 172)) in vastus lateralis muscle lysates from mice of indicated genotype at 
indicated time after exercise. g, Maximal treadmill running distance for mice of 
indicated genotype. Data represent mean + s.e.m. of 4-6 male mice per group. 
h, Soleus muscle '*C-deoxyglucose uptake during treadmill exercise in mice of 
indicated genotype. Data represent mean + s.e.m. of 3 mice per group. 

*P< 0.05, **P < 0.01, ***P < 0.001, one-way ANOVA for comparison 
between groups; P< 0.05, ++P < 0.01, two-way ANOVA for comparison of 
magnitude of changes between different groups in mice of different genotypes. 
NS, not significant. 
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No differences were observed in phosphorylation of CaMK1, a 
downstream target of the Ca?*-stimulated kinase CaMKKf (which 
is upstream of AMPK in certain contexts)'* (Supplementary Fig. 13b), 
suggesting that differences in levels of AMPK activation between wild- 
type and BCL2 AAA mice are not due to altered Ca” homeostasis. 
They are also probably unrelated to possible effects of BCL2 on 
mitochondrial function, because phosphorylated BCL2 resides at the 
endoplasmic reticulum’ and markers of mitochondrial function did 
not differ in the muscles of BCL2 AAA and wild-type mice (Sup- 
plementary Fig. 10). Differences in insulin signalling molecules also 
did not explain the altered glucose homeostasis in exercised BCL2 
AAA mice, because at late time points during exercise, decreased 
(not increased) AKT activation and mTOR activation was observed 
in muscle of both wild-type and BCL2 AAA mice (Supplementary 
Fig. 13a). No differences were observed in muscle phosphorylation 
of the AKT substrate AS160 (also known as TBC1D4), or levels of 
the muscle-specific forkhead transcription factor target atrogin 1 (also 
known as FBXO32), in response to exercise or between genotypes. 

To confirm that the observed defects in exercise-induced endurance 
and muscle glucose metabolism are due to defective autophagy, we 
examined mice with allelic loss of the autophagy gene beclin 1 (Becn1), 
which have decreased beclin 1 protein expression and decreased 
starvation-induced autophagy in skeletal muscle’. Similar to BCL2 
AAA mice, Becn1*’~ mice exhibited deficient exercise-induced 
autophagy in skeletal muscle (Supplementary Fig. 14a, b), normal 
muscle strength (Supplementary Fig. 14c), decreased maximal 
treadmill running distance (Fig. 3g), impaired exercise-induced 
GLUT4 plasma membrane localization (Supplementary Fig. 14d) 
and muscle glucose uptake (Fig. 3h), and lower levels of exercise- 
induced muscle AMPK activation (phospho-AMPK and its target, 
phospho-ACC) (Supplementary Fig. 14e) than wild-type controls. 
The similarity of the phenotypes of BCL2 AAA and Becn1*'~ mice 
provides strong support for a role of deficient beclin 1 activity, rather 
than other BCL2-regulated functions, in the impairment of exercise 
endurance, glucose uptake and AMPK activation in BCL2 AAA mice. 

AMPK activates autophagy in response to low energy conditions by 
sensing the cellular ratio of AMP to ATP”. Deficient AMPK activation 
in BCL2 AAA and Becn1*’~ mice during exercise raised the possibility 
that, similar to observations in vitro’, the autophagy pathway may 
function in vivo in a feed-forward manner in AMPK activation. To 
evaluate whether impaired exercise-induced AMPK activation in vivo 
is due to deficient autophagy, rather than potential autophagy- 
independent effects of the BCL2 AAA mutation or monoallelic loss 
of beclin 1, we examined exercise-induced AMPK activation in 
Atgl6l1"™ mice that are hypomorphic for ATGI6L1 (ref. 17), an 
essential autophagy protein that specifies the LC3 lipidation site for 
autophagosomal membrane biogenesis'®. Atg1611"™ mice had a sig- 
nificant reduction in ATG16L1 protein expression in skeletal muscle 
and a defect in exercise-induced autophagy in muscle lysates after 
80 min of exercise (Supplementary Fig. 15a, b). This defect in exercise- 
induced autophagy was associated with decreased exercise-induced 
AMPK phosphorylation and ACC phosphorylation (Supplementary 
Fig. 15c). 

Together, our in vivo studies in BCL2 AAA, Becn1*'~ and 
Atgi6l1™ mice suggest that cellular autophagy function is partially 
required for normal levels of exercise-induced muscle AMPK activa- 
tion. The mechanism underlying this requirement is not yet known, but 
is unlikely to be caused by non-specific effects of the mouse mutations 
on AMPK activation as the direct AMPK activator, the AMP analogue 
5-aminoimidazole-4-carboxamide riboside (AICAR), induced AMPK 
activation normally in BCL2 AAA, Becn1*’~ and Atgi6l1'"™ MEFs 
(Supplementary Fig. 16). 

We next examined whether exercise-induced autophagy is required 
for the beneficial metabolic effects of long-term exercise training, using a 
high-fat diet (HFD) model of obesity and impaired glucose tolerance*”. 
Age-matched cohorts of wild-type and BCL2 AAA mice were fed a HFD 
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for four weeks; the HFD was continued for an additional 8 weeks either 
combined with 50 min of daily treadmill exercise or with no exercise 
(Supplementary Fig. 17a). In both wild-type and BCL2 AAA mice, HFD 
led to body weight gain (Supplementary Fig. 17b), which was probably 
due to increased fat mass (Supplementary Fig. 18a), as no differences in 
lean mass or muscle fibre size were observed (Supplementary Fig. 18). 
HED led to modest suppression of baseline autophagy in skeletal and 
cardiac muscle as measured by levels of LC3-II conversion and p62 in 
tissue lysates (Supplementary Fig. 19). Ultrastructural analyses of the 
liver and pancreas did not reveal any differences in HFD versus regular 
diet animals, in wild-type versus BCL2 AAA mice, or in non-exercised 
versus exercised animals, except for hepatic lipid droplet accumulation 
in all HFD-fed groups (Supplementary Fig. 20). Although BCL2 AAA 
mice were more sensitive to HFD-induced obesity, 8 weeks of daily 
exercise reduced the magnitude of weight gain similarly in BCL2 
AAA and wild-type mice (Supplementary Fig. 17b). Thus, the BCL2 
AAA mutation did not alter the response of mice to HFD with respect to 
muscle fibre size, the morphology of liver and pancreas, or the effect of 
exercise on HFD-induced obesity. 

However, BCL2 AAA mice failed to exhibit normal exercise- 
induced protection against HFD-induced impaired glucose tolerance. 
Before the administration of HFD, wild-type and BCL2 AAA mice had 
similar clearance rates in oral glucose tolerance tests (Fig. 4a). Four 
weeks of HFD led to impaired glucose tolerance in both genotypes 
versus regular diet control groups (Fig. 4b); the magnitude of this 
impairment was similar in BCL2 AAA and wild-type mice despite 
more weight gain in the BCL2 AAA mice. However, exercise training 
markedly improved glucose tolerance in HFD-fed wild-type mice but 
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Figure 4 | Long-term exercise training protects wild-type but not BCL2 
AAA mice from HFD-induced glucose intolerance. a, b, Oral glucose 
tolerance test (OGTT) before (a, week 0) and after (b, week 4) 4 weeks of HFD. 
c,d, OGTT (c) and serum leptin and adiponectin levels (d) after 8 weeks of daily 
exercise. For a-d, results represent the mean + s.e.m. for 4-5 mice per group. E, 
exercise; NE, no exercise; RD, regular diet .*P < 0.05, **P < 0.01, (c, one-way 
ANOVA; d, Wilcoxon rank test). NS, not significant. 
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not in autophagy-deficient BCL2 AAA mice (Fig. 4c and Supplemen- 
tary Fig. 21). This failure of BCL2 AAA mice to show improved glucose 
tolerance is unlikely to be due to deficient insulin production, because 
at the end of the study HFD-fed wild-type and BCL2 AAA mice dis- 
played similar circulating fasting insulin levels (Supplementary Fig. 22a), 
similar levels of insulin secretion in response to oral glucose challenge 
(Supplementary Fig. 22b), and similar pancreatic B-cell morphology 
(Supplementary Fig. 20b). These results suggest that BCL2-regulated 
functions are essential for chronic exercise-mediated protection against 
HFD-induced glucose intolerance. We speculate that, as in acute 
exercise, this may involve the dynamic interplay of autophagy induc- 
tion and AMPK activation. 

HFD-induced obesity is associated with several other metabolic 
changes, including increased levels of serum leptin, an appetite- 
inhibiting adipokine;” reduced levels of circulating adiponectin”', an 
antidiabetic adipokine*~*; and increased serum triglycerides and 
cholesterol. In HFD-fed mice, baseline levels of leptin were higher in 
BCL2 AAA mice than in wild-type mice, and 8 weeks of exercise 
decreased serum leptin in wild-type but not in BCL2 AAA mice 
(Fig. 4d). Although adiponectin levels in HFD-fed mice were similar 
in the non-exercised wild-type and BCL2 AAA groups, they were sig- 
nificantly increased in the exercised wild-type mice versus the exercised 
BCL2 AAA mice (Fig. 4d). Both serum triglycerides and cholesterol 
increased in wild-type and BCL2 AAA HFD-fed mice; with exercise, 
the values returned to those observed in animals fed a regular diet in 
wild-type but not BCL2 AAA mice (Supplementary Fig. 23). Thus, the 
BCL2 AAA mutation impaired the beneficial effects of exercise on 
metabolic changes induced by a HFD. 

The HFD-fed, daily exercised wild-type mice were more metabolically 
active than their BCL2 AAA counterparts, as demonstrated by elevated 
levels of oxygen consumption, CO, production and heat genera- 
tion during a 12h night interval when the mice were not exercised 
(Supplementary Fig. 24a). In addition, in vastus lateralis muscle, 
wild-type mice had elevated induction of messenger RNA for UCP1, 
Ucp1, a mitochondrial uncoupling protein mainly expressed in brown 
adipose tissue that contributes to thermogenesis and energy expend- 
iture’> (Supplementary Fig. 24d). These changes were probably due to 
improved metabolic fitness in response to exercise rather than altera- 
tions in food intake, spontaneous physical activity, or general 
mitochondrial function. No differences were observed in daily food 
intake in exercised HFD-fed wild-type and BCL2 AAA mice (Sup- 
plementary Fig. 24b). Spontaneous physical activity was similar in 
HFD-fed non-exercised wild-type and BCL2 AAA mice, although there 
was a trend towards decreased spontaneous physical activity in the 
exercised BCL2 AAA mice (Supplementary Fig. 24c). No changes were 
observed with exercise in either genotype in mRNA levels of the skeletal 
muscle-expressed uncoupling proteins UCP2 or UCP3, or the mito- 
chondrial proteins cytochrome b and c (Supplementary Fig. 24d, e). 

The HFD study suggests that increased autophagy triggered by 
exercise may be critical for improving impaired glucose tolerance 
and metabolism in diet-induced obesity. However, we cannot definitely 
conclude that lack of exercise-induced improvement in glucose 
tolerance in HFD-fed BCL2 AAA mice is caused by deficient exercise- 
induced autophagy; it is possible that other effects of the BCL2 AAA 
mutation are responsible for this phenotype. Nonetheless, given our 
findings in acute exercise (which demonstrate impaired muscle glucose 
uptake, GLUT4 plasma membrane localization and AMPK activation 
in autophagy-deficient animals), it seems plausible that alterations in 
exercise-induced skeletal muscle glucose metabolism in autophagy- 
deficient animals may also contribute to the failure of exercise to reverse 
HFD-induced metabolic abnormalities. 

Our findings demonstrate that exercise is a potent inducer of 
autophagy, and that acute and chronic exercise enhances glucose 
metabolism in mice capable of inducing autophagy but not in 
autophagy-deficient mice. These beneficial metabolic effects (as well 
as exercise- and starvation-induced autophagy) are blocked by a 
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mutation in BCL2 that prevents its release from an inhibitory inter- 
action with the autophagy protein beclin 1. Thus, BCL2 has previously 
undescribed essential roles in the in vivo regulation of stimulus- 
induced autophagy as well as glucose metabolism. We propose that 
BCL2-regulated autophagy activation contributes to the beneficial 
metabolic effects of exercise, and that manipulation of the autophagy 
pathway and/or the function of the autophagy inhibitory BCL2 protein 
may be a logical strategy to mimic the health effects of exercise and to 
prevent or treat impaired glucose metabolism. More broadly, on the 
basis of this newly discovered link between exercise, autophagy and 
altered metabolism, we speculate that autophagy may represent a 
cellular mechanism by which exercise prolongs life and protects 
against cancer, cardiovascular disorders and inflammatory diseases’. 


METHODS SUMMARY 

Mouse strains. GFP-LC3 transgenic’, Becn1 */~ (ref. 15) and Atgléll HM (ref. 17) 
mice have been described. The generation of mice with a knock-in mutation in 
Bcl2 that results in Thr69Ala, Ser70Ala and Ser84Ala mutations in BCL2 (BCL2 
AAA) is described in Methods. 

Exercise studies. Acute and chronic exercise studies were performed using a 
treadmill protocol described in Methods. 

Autophagy assays. Autophagy was measured by visualizing GFP-LC3 puncta by 
fluorescence microscopy in MEFs or tissue sections from mice with transgenic 
expression of GFP-LC3, or by western blot analysis of LC3-II conversion and p62 
levels in tissue lysates. 

Metabolic analyses. Metabolic and blood parameters in the acute and chronic 
exercise studies were measured as described in Methods. 


Full Methods and any associated references are available in the online version of 
the paper at www.nature.com/nature. 
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METHODS 


Mouse strains. GFP-LC3 transgenic, BecnI*’~ and Atgi6l1'™ mice have been 
previously described”’*’’. For the construction of a mouse strain with knock-in 
mutations in the phosphorylation sites in the non-structured loop of BCL2 
(Bcl2**“ mice), BAC clones (Incyte) were screened for the presence of Bcl2 using 
the following primers: BCL2 5’, GIGGGGCGGGAGTCGGGACT; BCL2 3’, 
GACCCAGAATCCACTCACAC. The Bcl2 BAC clone was digested by BglII, 
subcloned into pSP72, and a puromycin resistance marker flanked by FRT sites 
(SalI fragment of pPGKPuro, cloned into pFRT) was blunt cloned into a BsaBI site 
3’ of exon IL A mutantallele was generated by subcloning a PstI fragment from the 
coding region. Amino acid residues Thr 69, Ser 70 and Ser 84 (homologous to 
human Ser 87) were changed to alanine in two steps using the Quickchange 
PCR (Stratagene) and the following primer sets: TSAA 5’, GAGATGGCTGCCA 
GGGCGGCTCCTCTCAGGCCCC; TSAA 3’, GGGGCCTGAGAGGAGCC 
GCCCTGGCAGCCATCTG; $84A 5’, GCTGGGCCTGCGCTCGCCCC TGTGC 
CACCATG; S84A 3’, CATGGTGGCACAGGGGCGAGCGCAGGCCCAGC. 
The targeting construct 7.1AAA was electroporated into RW4 embryonic stem 
cells (129 Sv/J), and 36h later, clones were selected with puromycin, picked and 
screened by Southern blot analysis with the probes indicated in Supplementary 
Fig. 4a. A 1.7kb shift upwards indicative of the mutant allele was detected on 
Southern blots in 2/300 clones as the targeted locus (Supplementary Fig. 4b). 
These clones were tested for normal karyotype and used to inject blastocysts 
from C57BL/6] donors. Mice with germline transmission were bred to mice 
expressing FLP from the f-actin promoter (Jackson Laboratories: B6; 
SJLTgN(ACTFLPe)9205Dym), and offspring were screened for deletion of the 
puromycin selection marker. To screen for the presence of the mutation following 
excision of the selectable marker, PCR with 5’ and 3’ Bcl2 primers was performed, 
and digestions were performed with Bgll or AatII enzymes; Bgll cuts the amplified 
fragment only if the T69AS70A site is present, while AatlII cuts only the wild-type 
sequence for the same region (Supplementary Fig. 4c). Using this scheme, 
homozygous Bcl2*** mutants were identified by the presence of PCR fragments 
cleaved only by Bgll and not AatII, whereas wild-type mice were identified by the 
presence of PCR fragments cleaved only by AatII and not Bgll. Bcl2*** mice were 
backcrossed for more than ten generations to C57/B6 mice (Jackson Laboratories), 
and homozygous Bcl2“** and Bcl2" offspring were used in all studies. 
Cell lines. Primary MEFs were established either from Bcl2**“ and Bcl2“”* mice, 
the offspring of Bcl2*** and Bcl2" mice crossed with GFP-LC3 transgenic mice, 
Becn1*'~ and Becn1*'* mice, and Atgi6l1"™ and Atg16l1" at embryonic day 
13.5 and cultured as described”. 
Radiolabelling and co-immunoprecipitation from MEFs. MEFs derived from 
Bcl2“*4 and Bcl2“' mice were cultured overnight in labelling medium (phosphate- 
free DMEM with 10% dialysed FBS) and then cultured in starvation medium (EBSS 
minus phosphate) or labelling medium for 4h containing 2mCiml' *P- 
orthophosphate. Cells were lysed in lysis buffer containing 50 mM Tris (pH 7.9), 
150 mM NaCl, 1 mM EDTA, 1% Triton X-100, proteinase inhibitor cocktail (Roche 
Applied Sciences) and halt phosphatase inhibitor cocktail (Thermo Scientific), and 
were subjected to immunoprecipitation with a monoclonal anti-BCL2 antibody 
(Santa Cruz Biotechnology, 1:50). Eluates were separated by SDS-PAGE and 
detected by anti-beclin 1 antibody (Santa Cruz Biotechnology, 1:200), anti- 
BCL2-HRP antibody (C2, Santa Cruz Biotechnology, 1:100) and autoradiography. 
Co-immunoprecipitations from muscle tissue. Vastus lateralis (thigh muscle) 
was dissected and homogenized in lysis buffer containing 50 mM Tris (pH 7.9), 
150mM NaCl, 1mM EDTA, 1% Triton X-100, proteinase inhibitor cocktail 
(Roche Applied Sciences) and halt phosphatase inhibitor cocktail (Thermo 
Scientific), and subjected to immunoprecipitation with a monoclonal anti-BCL2 
antibody (Santa Cruz Biotechnology, 1:50). Eluates were separated by SDS-PAGE 
and detected by anti-beclin 1 antibody (Santa Cruz Biotechnology, 1:200) and anti- 
BCL2-HRP antibody (C2, Santa Cruz Biotechnology, 1:100) and autoradiography. 
Mouse exercise studies. For acute exercise studies, 8-week-old (wild type, BCL2 
AAA and Atgl 6l1™™) or 12-week-old (Becn1*/~ and wild-type Becn1*/* 
littermate) mice were acclimated to and trained on a 10° uphill Exer 3/6 open 
treadmill (Columbus Instruments) for 2 days. On day 1 mice ran for 5 min at 
8mmin _' and on day 2 mice ran for 5 min at 8mmin’ | followed by another 
5 min at 10mmin *. On day 3, mice were subjected to a single bout of running 
starting at the speed of 10m min‘. Forty minutes later, the treadmill speed was 
increased at a rate of 1!mmin every 10 min for a total of 30 min, and then 
increased at the rate of l1mmin | every 5min until mice were exhausted. 
Exhaustion was defined as the point at which mice spent more than 5s on the 
electric shocker without attempting to resume running. Total running time was 
recorded and total running distance was calculated for each mouse. 

For long-term exercise training, male Bcl2" or Bcl2*** mice (8 weeks old) 
were randomly divided into three cohorts, including: (1) mice fed a regular diet 
(Harlan Teklad) without daily exercise, (2) mice fed a HFD containing 60% fat 


(Research Diets) without daily exercise, and (3) mice fed a HFD with daily exercise. 
Prior to initiation of exercise, mice were fed a HFD for 4 weeks. At the end of the 
fourth week, mice in the exercise groups were acclimated to treadmill running for 2 
days as in the acute exercise studies, and then were trained on the treadmill with 
10° uphill incline for 50 mind” ', 5 d week! at 17m min‘ for 8 weeks. Mice were 
given a HFD during the 8-week training period. All animal procedures were 
performed in accordance with institutional guidelines and with approval from 
the Institutional Animal Care and Use Committee. 

Autophagy analyses. MEFs expressing GFP-LC3 were cultured in normal or 
starvation (EBSS, Earle’s balanced salt solution) medium for 4h, and GFP-LC3 
puncta were quantified by fluorescence microscopy as described’’. For assessment 
of autophagy in vivo following starvation or exercise, 8-week-old Bcl2 or Bcl2*** 
GFP-LC3 mice or Becn1*'* or Becn1*'~ GFP-LC3 mice were either subjected to 
starvation for 48h or exercised for the indicated time period, anaesthetized by 
isoflurane, and perfused with 4% paraformaldehyde (PFA). Skeletal muscle (vastus 
lateralis, tibialis anterior, extensor digitorum longus and soleus), heart (left 
ventricle), pancreas and liver tissues were fixed in 4% PFA overnight, 15% sucrose 
for 4h and 30% sucrose overnight before frozen sections were prepared. The 
number of GEP-LC3 puncta per unit area of tissue was quantified by fluorescence 
microscopy as described’’. Autophagy in skeletal and cardiac muscle, liver, 
pancreas and adipose tissue during baseline conditions and after exercise was also 
analysed by western blot analysis of tissue extracts with antibodies against LC3 and 
p62 (see below for details). 

Immunofluorescence studies. For immunofluorescence staining of frozen muscle 
and pancreatic sections, slides were heated at 50 °C for 10 min, rehydrated in 100%, 
95% and 70% ethanol, washed 3 X 5 min in PBS, blocked with 1% goat serum and 
immunostained with a rabbit anti-mouse GLUT4 antibody (Alpha Diagnostic, 
1:100 dilution) (muscle samples) and an Alexa Fluor 594 donkey anti-rabbit 
secondary antibody (Invitrogen, 1:1,000 dilution) or with a guinea pig anti-swine 
insulin antibody (Dako, 1:500 dilution) (pancreatic samples) and an Alexa Fluor 
594 goat anti-guinea pig secondary antibody (Invitrogen, 1: 500 dilution). 
Immunofluorescence studies on skeletal muscle sections with anti-laminin and 
anti-type I myosin heavy chain antibodies were performed as previously 
described’*. Immunofluorescence images were taken using a Zeiss Axioplan2 
microscope or Leica TCS SP5 confocal microscope. 

Muscle morphology and function studies. Sections were stained with haematoxylin 
and eosin to visualize tissue architecture following standard protocols. 
Metachromatic ATPase staining and measurement of fibre cross-sectional area 
were performed as previously described”*. Periodic acid-Schiff staining for carbo- 
hydrates was performed according to standard protocols”. Succinic dehydrogenase 
staining on cryosections was carried out using a 0.2 M phosphate buffer at pH 7.6. 
For staining, 270 mg succinic acid and 10 mg nitro blue tetrazolium were freshly 
dissolved in 10 ml phosphate buffer and incubated with sections for 1h. 

To measure muscle strength, a modified mesh grip assay was applied”°. Briefly, 
mice were placed on the wired mesh of cage tops. When they firmly grabbed the 
mesh, the cage top was flipped and held at a height of 30 cm above the bench until 
the mice released their grip. The time for each mouse to remain on the mesh was 
recorded. TUNEL staining of muscle sections to detect apoptosis was performed 
according to the manufacturer's instructions (ApopTag Peroxidase In situ 
Apoptosis Detection Kit, Millipore), using Sigma FAST 3,3'-diaminobenzidine 
(DAB) tablets as the peroxidase substrate. 

Electron microscopy. Mice were euthanized, and liver and pancreas were rapidly 
fixed in 2.5% glutaraldehyde in 0.1 M cacodylate buffer. Electron microscopy was 
performed as previously described*'. 

Mouse metabolic studies and blood tests. Food intake, O, consumption, CO, 
production, heat production and physical activity were measured in TSE 
metabolic chambers (TSE Systems) over a 4 day period. Mice in the exercise group 
were removed from the chambers and allowed to run for 50 min daily as in the 
above protocol during the day (light period). Metabolic measurements were per- 
formed for 12h daily during the dark period. Fat mass and lean tissue mass were 
determined using the Bruker Minispec mq10 scan system. Plasma glucose, lactate, 
cholesterol and triglycerides were measured using the Vitros 250 system (Ortho- 
Clinical Diagnostics). Plasma insulin (Crystal Chem), serum leptin (Crystal 
Chem) and adiponectin (Millipore) levels were determined using commercial 
ELISA kits. Glucose tolerance tests were performed by oral glucose injection 
el gkg ') after a 3h fast. Blood was drawn from tail veins at the indicated time 
points after glucose injection and serum glucose levels were analysed using 
commercial glucose assay reagents (Sigma, Cat# 6918, P7119, F5803). 
Echocardiography. Cardiac function was assessed by echocardiography, using a 
Visualsonic Vevo 2100 ultrasound machine equipped with a 30-Mhz transducer 
applied to the chest wall. Ventricular dimensions and ejection fraction were 
assessed using short axis view in two-dimensional and three-dimensional modes 
using the onboard VisualSonics cardiac analysis package. 
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Muscle glucose uptake assays. Surgical catheterization of the jugular vein was 
performed on each mouse. At day 4 after catheterization, mice were transferred to 
clean cages at 9:00 a.m. to begin a 4h fast. Mice then ran on a treadmill for 50 min 
using the exercise protocol described above. At t = 50 min, a 13 1Ci bolus of Ce 
deoxyglucose was injected intravenously and mice resumed running for another 
25min according to the exercise protocol described above before euthanasia. 
Skeletal muscle (soleus) and brain tissues were dissected and homogenized in 
1.5 ml 0.5% perchloric acid. '*C-deoxyglucose counts were determined in homo- 
genized samples as previously described” and muscle values were normalized to 
the counts in the brain. 

Western blot analyses. Mouse tissue extracts were prepared by homogenizing 
tissues in lysis buffer containing 50mM Tris (pH 7.9), 150mM NaCl, 1mM 
EDTA, 1% Triton X-100, proteinase inhibitor cocktail (Roche Applied Sciences) 
and halt phosphatase inhibitor cocktail (Pierce), and subjected to western blot 
analysis with anti-LC3 (Novus Biologicals, 1:500), anti-p62 (BD Biosciences, 1:500 
dilution; Progen (C-terminal specific), 1:1,000 dilution). Anti-p-CaMKI (Santa 
Cruz Biotechnology, 1:100), anti-CaMKI (Santa Cruz Biotechnology, 1:300), 
anti-BCL2 (Santa Cruz Biotechnology, 1: 100), anti-AMPK (Cell Signaling, 
1:1,000), anti-p-AMPK (Cell Signaling, 1:1,000), anti-JNK (Cell Signaling, 
1:400), anti-p-JNK (Cell Signaling, 1:500) anti-AS160 (Millipore, 1:400) anti-p- 
AS160 (Millipore, 1:100), anti-4E-BP1 (Cell Signaling, 1:1,000), anti-p-4E-BP1 
(Cell Signaling, 1:1,000), anti-p-ACC (Millipore, 1:500) anti-AKT (Cell 
Signaling, 1:600), anti-p-AKT (Cell Signaling, 1:500), anti-p38 (Cell Signaling, 
1:500), anti-p-p38 (Cell Signaling, 1:400), anti-atrogin-1 (ECM Biosciences, 
1:400), ATG16L1 (Sigma, 1:1,000) and anti-actin (Santa Cruz Biotechnology, 
1:3,000) antibodies. 

For in vitro assessment of AMPK activation, MEFS derived from Bcl. AA and 
Bcl2" mice, Becn1*/~ and Benc1‘'* mice, and Atg16l1'™ and AtgI6l1" mice 
were treated with 2 mM of the AMP analogue AICAR, or DMSO vehicle for 2h. 
Whole-cell lysates were prepared, and AMPK and phospho-AMPK levels were 
determined by western blot analysis. For quantification, the relative intensity 
values of western blot bands were normalized to that of the first lane (set as 1) 
in the wild-type non-exercised group. 

Statistical analyses. ANOVA approaches were used to compare values among 
different experimental groups for data that met the normality assumption. One- 
way ANOVA was used for comparison between two groups. Two-way ANOVA 
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was used for comparison of the magnitude of changes between two different 
groups in mice of two different genotypes. The normality assumption for the 
ANOVA model was checked using residual plots. When the assumption was 
violated, the data were log-transformed in order to meet the assumption. For data 
sets in which log transformation was inadequate to meet the analysis assumption, 
the non-parametric Wilcoxon rank-sum test was used. 

Real-time PCR analyses. RNA isolation from muscle tissues and real-time RT- 
PCR were performed as previously described’*. The following Sybr Green primers 
were used: f-actin: forward, CTGGCTCCTAGCACCATGAAGAT; reverse, 
GGTGGACAGTGAGGCCAGGAT; UCP1: forward, TCAGGATTGGCCTCTA 
CGAG; reverse, TAAGCCGGCTGAGATCTTGT; UCP2: forward, T@CCCGTA 
ATGCCATTGTC; reverse, AGTGGCAAGGGAGGTCATCT; cytochrome c: 
forward, GI'CT'GTTCGGGCGGAAGACAG; reverse. GGGGAGAGGATAC 
CCTGATGG; and cytochrome b: forward, ATTCATTGACCTACCTGCCC; 
reverse, TCTGATGTGTAGTGTATGGC. The following Tagman primers were 
used: PPARGCla (PGC1a): Mm00447183_m1; myoglobin: Mm00442968_m1; 
PPARGCI1B (PGCI1B): Mm01258518_ml; MYH7 (MHC Type 1D: 
Mm00600555_ml; MYH1 (MHC Type IId/x): Mm01332488_g1; MYH2 (MHC 
Type Ila): Mm01332564_m1; and GAPDH: Mm99999915_ gl. 
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Multi-isotope imaging mass spectrometry quantifies 
stem cell division and metabolism 


Matthew L. Steinhauser’?, Andrew P. Bailey*, Samuel E. Senyo'?, Christelle Guillermier”*°, Todd S. Perlstein’, Alex P. Gould’, 


Richard T. Lee’*® & Claude P. Lechene*** 


Mass spectrometry with stable isotope labels has been seminal in 
discovering the dynamic state of living matter’”, but is limited to 
bulk tissues or cells. We developed multi-isotope imaging mass 
spectrometry (MIMS) that allowed us to view and measure stable 
isotope incorporation with submicrometre resolution**. Here we 
apply MIMS to diverse organisms, including Drosophila, mice and 
humans. We test the ‘immortal strand hypothesis’, which predicts 
that during asymmetric stem cell division chromosomes contain- 
ing older template DNA are segregated to the daughter destined to 
remain a stem cell, thus insuring lifetime genetic stability. After 
labelling mice with '"N-thymidine from gestation until post-natal 
week 8, we find no '°N label retention by dividing small intestinal 
crypt cells after a four-week chase. In adult mice administered '*N- 
thymidine pulse-chase, we find that proliferating crypt cells dilute 
the '°N label, consistent with random strand segregation. We 
demonstrate the broad utility of MIMS with proof-of-principle 


a DIC 15N:14N b 


Parallel 


Sequential 


18N-thymidine 18C-thymidine 


ae ee 


studies of lipid turnover in Drosophila and translation to the 
human haematopoietic system. These studies show that MIMS 
provides high-resolution quantification of stable isotope labels 
that cannot be obtained using other techniques and that is broadly 
applicable to biological and medical research. 

MIMS combines ion microscopy with secondary ion mass spectro- 
metry (SIMS), stable isotope reporters and intensive computation 
(Supplementary Fig. 1). MIMS allows imaging and measuring of stable 
isotope labels in cell domains smaller than one micrometre cubed. We 
tested the potential of MIMS to track DNA labelling quantitatively with 
’N-thymidine in vitro. In proliferating fibroblasts, we detected label 
incorporation within the nucleus by an increase in the '°N:'N ratio 
above natural ratio (Fig. 1a). The labelling pattern resembled chromatin 
with either stable isotope-tagged thymidine or thymidine analogues 
(Fig. 1b). We measured dose-dependent incorporation of N-thymidine 
over three orders of magnitude (Fig. 1c and Supplementary Fig. 2). We 
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Figure 1 | MIMS quantification of stable isotope-labelled thymidine 
incorporation by dividing cells. a, Dividing fibroblast labelled with '°N- 
thymidine. The cell surface was sputtered to reach the nuclei. Left, differential 
interference contrast reflection microscopy. Middle, '*N image revealing 
subcellular details including the nucleus (white arrows) with nucleoli. Right, 
hue saturation intensity (HSI) image mapping the '°N:'4N ratio. The rainbow 
scale ranges from blue, set to natural ratio (0.37%, expressed as 0% above 
natural ratio), to red, where the ratio is several fold above natural ratio (700%, 
eight times the natural ratio). '*N labelling is concentrated in the nucleus. Scale 
bars, 10 jum. b, Fibroblast nuclei after serial or parallel DNA labelling with TNE 
thymidine, '*C-thymidine, BrdU (°'Br) or iododeoxyuridine (1771). Parallel 
labelling (top row): colocalization of label confirmed by the merged image (far 


right). Sequential labelling (bottom row): non-superimposable nuclear 
labelling. Scale bars, 5 um. c, Concentration-dependent nuclear 'SN-labelling in 
'°N-thymidine treated fibroblasts. Pixel-by-pixel quantification of N- 
incorporation. Data are derived from the mean of intranuclear pixels. 
Sigmoidal dose response curve: R* = 0.99. d, Nuclei from '°N-thymidine- 
labelled human foreskin fibroblasts after 24-h chase (cycle approximately 18 h). 
Scale bars, 5 um. e, One group labelled similarly to control cells (undivided), the 
other with labelling that was approximately half that of control (divided). 

f, °N:'4N HST image of the small intestine. '"N-thymidine labelling (one week) 
of nuclei extends from the crypts to the tips of the villi. Mosaic: 8 tiles, 80 Lm 
each. Scale bar, 30 tum. g, Sigmoidal dose response curve: 'SN-thymidine 
labelling after single subcutaneous injection (R? = 0.99). 
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also tracked fibroblast division after a 24-h label-free chase (Fig. 1d, e 
and Supplementary Fig. 3). Cells segregated into two populations, one 
indistinguishable from control cells suggesting no division, the other 
with halving of label, consistent with one division during the chase. 

We found similar results by tracking cell division in vivo in the small 
intestine (Fig. 1f, g and Supplementary Figs 4-6). We measured dose- 
dependent '°N-thymidine incorporation within nuclei of actively 
dividing crypt cells (Fig. 1g and Supplementary Fig. 4), down to a dose 
of 0.1 ug per g (Supplementary Fig. 2). The cytoplasm was slightly 
above natural ratio, probably due to low-level soluble °N-thymidine 
or mitochondrial incorporation (Supplementary Fig. 2). We measured 
halving of label with each division during label-free chase (Supplemen- 
tary Fig. 6). 

We then tested the ‘immortal strand hypothesis’, a concept that 
emerged from autoradiographic studies” and that predicted long-term 
label retaining cells in the small intestinal crypt*’. It proposes that 
asymmetrically dividing stem cells also asymmetrically segregate 
DNA, such that older template strands are retained by daughter cells 
that will remain stem cells and newer strands are passed to daughters 
committed to differentiation (Supplementary Fig. 7)*°. Modern studies 
continue to argue both for*”” or against’*'® the hypothesis, leading to 
the suggestion that definitive resolution of the debate will require a new 
experimental approach”. 

Although prior evidence suggests a concentration of label-retaining 
cells in the +4 anatomic position”*, we searched for DNA label reten- 
tion irrespective of anatomic position or molecular identity. We 
labelled mice with °N-thymidine for the first 8 weeks of life when 
intestinal stem cells are proposed to form®. After a 4-week chase, mice 
received bromodeoxyuridine (BrdU) for 24h before euthanasia to 
identify proliferating cells (Fig. 2a and Supplementary Fig. 8, Experi- 
ment 1), specifically crypt base columnar (CBC) cells and transit amp- 
lifying cells (Supplementary Fig. 9), which cycle at a rate of one and two 
times per 24h, respectively'* (Supplementary Fig. 10). All crypt cell 
nuclei were highly labelled upon completion of '°N-thymidine; after a 
4-week chase, however, we found no label retention by non-Paneth 
crypt cells (Fig. 2b-f; n = 3 mice, 136 crypts analysed). '°N-labelling in 
N*/BrdU- Paneth and mesenchymal cells was equivalent to that 
measured at pulse completion (Fig. 2b, c) indicating quiescence during 
the chase (values above '°N:'4N natural ratio: Paneth pulse = 107.8 + 
5.0% s.em. m=51 versus Paneth pulse-chase = 96.3 + 2.8% s.e.m. 
n= 218; mesenchymal pulse = 92.0+5.0% s.em. n= 89 versus 
mesenchymal pulse-chase = 90.5 + 2.2% s.e.m. n = 543). The number 
of randomly selected crypt sections was sufficient to detect a frequency 
as low as one label-retaining stem cell per crypt irrespective of ana- 
tomic location within the crypt. Because each anatomic level contains 
approximately 16 circumferentially arrayed cells*, a two-dimensional 
analysis captures approximately one eighth of the cells at each ana- 
tomic position (one on each side of the crypt; Supplementary Fig. 9a). 
Therefore, assuming only one label-retaining stem cell per crypt we 
should have found 17 label-retaining cells in the 136 sampled crypts 
(one eighth of 136) but we found 0 (binomial test P< 0.0001). The 
significance of this result held after lowering the expected frequency of 
label-retaining cells by 25% to account for the development of new 
crypts, a process thought to continue into adulthood”. In three addi- 
tional experiments, using shorter labelling periods and including in 
utero development, we also found no label-retaining cells in the crypt 
other than Paneth cells (Supplementary Fig. 8, Experiments 2-4). 

To address the possibility that long-term thymidine exposure or 
frequent high-dose injections introduced pitfalls, we limited labelling 
to the previously reported peak time period of label-retaining stem cell 
formation® and reduced the dose by 50-fold compared to prior experi- 
ments (Supplementary Fig. 8, Experiment 5). Approximately six label- 
retaining cells per intestinal circumference were observed previously* 
with a similar protocol; with about 90 crypts per circumference, this 
translates to 1 label-retaining cell per 15 sectioned crypts. We analysed. 
330 crypts and observed 19 '°N*/BrdU~ Paneth cells (one per 17 crypt 
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Figure 2 | No label-retaining stem cells in the small intestinal crypt. a, '°N- 
thymidine administered from post-natal day 4 to week 8. After 4-week chase, 
BrdU was administered (500 j1g intraperitoneally every 6h) for 24h before 
euthanasia (see Supplementary Fig. 8). b, ‘“N: crypt structure and intense signal 
in intracytoplasmic Paneth granules at the crypt base. *'P: intense intranuclear 
signal. **S: intense signal within cytoplasmic Paneth cell granules. *'Br: direct 
measure of BrdU incorporation. ‘°N:"*N: HSI image of ‘°N-thymidine labelling 
within '°N*/BrdU~ Paneth cells (large arrow) and mesenchymal cells (small 
arrows). No other cells reveal °N retention. Large arrow head: recently formed 
(BrdU~/'°N7 ) Paneth cell. Small hatched arrow (middle left side of the crypt): 
unlabelled Paneth cell, (“°N~/BrdU_ ). Scale bar, 15 jm. c, Continued analysis 
of the same crypt after narrowing the acquisition field. High '°N signal in a 
BrdU Paneth cell (large arrow) and mesenchymal cells closely associated with 
the crypt (small arrows). BrdU* CBC (hatched arrow) and Paneth cell (arrow 
head) are ISN. Scale bar, 5 jum. d, NN HSI image of small intestinal crypt 
at the end of °N-thymidine pulse. All nuclei are labelled. Nuclei with lesser 
degrees of labelling likely represent cells born during a period of lower 
circulating °N-thymidine as expected given the different labelling protocols 
(Supplementary Fig. 8). Scale bar, 20 um. e, Unlabelled mouse image. The entire 
crypt contains the natural abundance of 'SN. Scale bar = 10 jim. f, Mean % 
'NT cells at the completion of pulse and after pulse-chase ( + standard error of 
CBC, transit amplifying (TA), Paneth and mesenchymal (Mes) cells (n = 3 
mice per group). 


sections), but no non-Paneth label-retaining cells (expected, 21; 
observed, 0; binomial test P<0.0001) or doubly labelled (°N*/ 
BrdU") cells irrespective of cellular identity. Thus, after labelling dur- 
ing all potential periods of stem cell formation, spanning in utero 
through post-natal development, we found no label-retaining stem 
cells in the small intestinal crypt other than Paneth cells, which are 
largely quiescent. We conclude therefore that the phenomenon of 
label-retaining cells as described by others was most likely due either 
to reagent toxicity or to insufficient resolution with the consequent 
misidentification of labelled post-mitotic Paneth cell nuclei as label 
retaining +4 stem cells. MIMS minimizes these potential artefacts due 
to its ability to detect non-radioactive isotopes and the high imaging 
resolution both laterally and vertically. Although our data cannot 
directly exclude the migration of label-retaining stem cells into the 
crypt under non-homeostatic conditions such as radiation injury'’, 
prior examinations of crypt clones argue against contributions from 
either a resting stem cell population or from stem cells originating 
outside the crypt”. 

We also studied label release in adult mice after stem cell formation. 
These experiments are particularly relevant given the recent suggestion 
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that crypt stem cells exhibit monoclonal drift over 8 months 
model that is incompatible with lifetime survival of label-retaining 
stem cells but that does not exclude the possibility of biased template 
strand segregation. After administering '"N-thymidine for 2 weeks to 
label all proliferating crypt cells, '"N-thymidine was stopped and BrdU 
was administered for 24h to label cells dividing during the °N- 
thymidine-free chase (Fig. 3a). A consequence of the immortal strand 
hypothesis would be that a stem cell would produce a daughter in 
which '°N label was completely lost with the first division during chase 
(Supplementary Fig. 7). We observed no such BrdU‘ /'°N’ crypt cells, 
even after extending the '°N-thymidine-free chase to 48 h (Fig. 3b and 
Supplementary Fig. 11). Proliferating (BrdU*) crypt cells (n = 3mice; 
563 cells analysed) diluted N-thymidine label (mean 71% above 
natural ratio) compared to undivided (BrdU ) cells (129% above natural 
ratio) or cells analysed at the end of '"N-thymidine pulse (128% above 
natural ratio), as expected of cells undergoing random chromosomal 
segregation (Fig. 3c). We also analysed crypt cell nuclei in late mitosis, 
finding '°N label and BrdU in both sets of segregating chromosomes 
(Fig. 3d and Supplementary Fig. 12, n = 232). Together, these data result 
from an analysis of 625 crypts, in which we did not find a single BrdU * 
nucleus or chromosomal complement that was not 1SN_labelled, indi- 
cating that dividing cells in the crypt uniformly dilute labelled DNA ina 
pattern consistent with random DNA strand segregation. 

The broad applicability of MIMS is demonstrated by proof-of- 
principle studies in Drosophila and humans. Drosophila provides a 
model system in which to study the genetics of lipid metabolism, in vivo. 
Due to its small size, however, it is difficult to obtain sufficient material 
from specific tissue/cell types for measurement with conventional mass 
spectrometry. A previous study showed numerous lipid droplets in the 
enterocytes of a segment of the Drosophila larval intestine, the anterior 
midgut”. We therefore used MIMS to measure the incorporation and 
turnover of diet-derived '*C-palmitate within these cells. After larval 
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Figure 3 | Label-dilution in adult mice indicates random segregation of 
DNA strands. a, 'SN-thymidine administered for 2 weeks to adult mice 
(osmotic mini-pump, 20 pg h’), then BrdU for 24h before euthanasia (See 
Supplementary Fig. 8). b, Dividing cells (BrdU~) dilute '"N-thymidine label 
(small arrows) relative to undivided cells (BrdU) (large arrows). Note two 
CBC cells with elongated nuclei at the crypt base. Scale bar, 10 tm. ¢, Divided 
crypt cells (BrdU*), residing in CBC or +4-10 positions, demonstrated Ne 
dilution consistent with one or two rounds of division during the chase (n = 3 
mice per group). d, Mitotic crypt cell. Segregating chromosomes are visible in 
"4N and *'P images. ‘°N-label and BrdU were measured in both segregating 
chromosomal complements consistent with symmetric chromosomal 
segregation. Scale bar, 2 um. 
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exposure to dietary ‘*C-palmitate, label is incorporated into enterocyte 
lipid droplets with a subcellular pattern similar to that seen with a 
neutral lipid dye (compare Fig. 4a with supplementary Fig. 13). 
Dietary '*C-palmitate pulses of varying periods indicate that the rate 
of '°C label incorporation into lipid droplets is greater in anterior 
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Figure 4 | Extending MIMS to Drosophila and human metabolism. 

a, Drosophila anterior midgut enterocytes (gut, left panels) accumulate more 
'8C from 24-h exposure to dietary '*C-palmitate than adipocytes (fat body cells 
(fb), right panels). Lipid droplets (arrows) have low nitrogen content and so 
appear dark in '*N images (top panels). The '°C:'*C HSI images reveal '*C 
incorporation into lipid droplets. Scale bars, 10 um. b, Time course of °C 
accumulation in Drosophila lipid droplets of anterior midgut enterocytes (Gut) 
and adipocytes (Fat body) over 24h of larval exposure to dietary '*C-palmitate. 
Initial rates of *C-incorporation in the lipid droplets were 9.7% h’ ' (Gut) and 
1.16% h ' (Fat body) with the first time point after pulse taken at 1 h. c, Rapid 
lipid turnover in Drosophila anterior midgut enterocytes (gut) after larval 
dietary '°C-palmitate pulse-chase. Upper panels show '‘N images and lower 
panels show corresponding '*C:'*C HSI images, indicating strong '°C 
incorporation at the end of the pulse (bottom left) and rapid elimination during 
a 4-h chase (bottom right). Scale bar = 5 jum. d, '3C tracer dilution in the lipid 
droplets of anterior midgut enterocytes during a label-free chase fits an 
exponential decay curve (R? = 0.90, half-life = 9.1 h). e, °N-thymidine 
administered for 48 h: no labelled white blood cell were found. After a 4-week 
chase we found few labelled lymphocytes. Top: unlabelled polymorphonuclear 
leukocyte (PMN) with multi-lobed nucleus. Bottom: '°N-labelled lymphocyte 
after chase. The round nucleus seen in the *'P image occupies the majority of 
the cell, as is typical for lymphocytes. Scale bars, 3 um. 


©2012 Macmillan Publishers Limited. All rights reserved 


midgut enterocytes than in the adipocytes of the fat body, the major 
lipid storage depot in Drosophila (Fig. 4a, b). Pulse-chase experiments 
with dietary '*C-palmitate also demonstrate a reproducible exponen- 
tial decay (R* = 0.9) of the ‘°C signal within enterocyte lipid droplets 
and provide an estimate for the half-life of approximately 9 h (Fig. 4c, 
d). These results demonstrate the utility of MIMS for precise measure- 
ment of in vivo lipid turnover at high resolution within individual 
Drosophila lipid droplets. 

In the first MIMS human experiment, we administered Ne 
thymidine to a healthy volunteer by intravenous infusion for 48h. 
We performed MIMS analysis on peripheral white blood cell smears. 
We found no labelled white blood cells at the end of the infusion (3,600 
analysed). After a 4-week chase we found a few labelled white blood 
cells (Fig. 4c, 4 of approximately 2,000 cells analysed) consistent with a 
lag time before release from the bone marrow (chi-squared, P < 0.01). 
Radiolabelled thymidine has been administered to humans in numerous 
studies previously”, but these studies were conducted primarily on 
cancer patients. There is no known risk to using stable isotopes”. 
Thus, this experiment now opens the door to studies of metabolism 
and cell tracking in humans. 

In conclusion, we provide direct evidence under physiological con- 
ditions against non-random template strand segregation in the small 
intestine. Our approach encompasses all potential stem cell populations 
in the crypt, those identifiable by known markers**** and putative ones 
not identified by a specific marker. We have demonstrated the power of 
MIMS to image and quantify cell metabolism in three different con- 
texts: mammalian intestinal cell division, Drosophila lipid metabolism 
and human lymphopoiesis. We anticipate that the measuring and 
imaging power of MIMS will make it a major tool to study metabolism 
and cell fate in animals and humans. 


METHODS SUMMARY 


Data were acquired with the prototype (four detectors) and a second-generation 
instrument (NanoSIMS 50L, Cameca: seven detectors). Quantitative mass images 
were analysed with Open MIMS software, a plug-in to Image] (http://www.nrims. 
harvard.edu/software.php). Stable isotopes were obtained from Cambridge 
Isotopes (°N-thymidine, C-thymidine) and Sigma-Aldrich (°C-palmitate). 


Full Methods and any associated references are available in the online version of 
the paper at www.nature.com/nature. 
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METHODS 


Multi-isotope imaging mass spectrometry (MIMS). MIMS is based on secondary 
ion mass spectrometry (SIMS). Anionic beam sputters the surface atomic layer ofa 
sample, resulting in the ionization of a small atomic fraction. Ina SIMS instrument, 
secondary ions are separated by mass, and then used to derive a quantitative atomic 
mass image of the surface of the analysed sample. The instrument can simulta- 
neously measure data from multiple isotopes from the same region. Data included 
in this study are derived from both the NanoSims prototype (ONERA, Université 
Paris Sud/Cameca) and a second-generation instrument (NanoSims 50L, 
Cameca). The second-generation instrument is fully automated and has the capa- 
city to quantify seven masses simultaneously. 

Quantitative image data extraction. For each pixel, the number of counts of a 
given secondary ion is stored in a computer file. The data are reconstructed into a 
greyscale image in which the pixel intensity is derived from the total number of 
counts of a given secondary ion within the area representing a given pixel. The 
lateral resolution is dependent on factors, including the beam size and the number 
of pixels per image acquisition area. A series of images based on the analysis of 
successive atomic layers can be summed. The incorporation of halogenated 
nucleotide analogues (BrdU, iododeoxyuridine) is assessed by directly quantifying 
the ionized halogen atoms "Br, A/T); which are found at extremely low back- 
ground concentrations in most tissues. Halogen enrichment within a given region 
of interest is expressed as the percent enrichment over natural background. 

Hue saturation intensity ratio images. The localization of tracers enriched with 
rare stable isotopes (for example, '"N-thymidine) is done by deriving quantitative 
ratio images. The ratio of a rare stable isotope and the common stable isotope is 
quantified on a pixel by pixel basis. Incorporation of a stable isotope within a 
region of interest is determined when the ratio of the rare stable isotope to the 
common stable isotope exceeds the natural ratio. Because quantitative images 
acquired with MIMS generate far more data than can be represented by a grey- 
scale, we previously developed a method of visually representing ratio data based 
on a hue saturation intensity (HSI) transformation. With an HSI image, the scale 
can be set such that the lower bounds of the scale (blue colours) are assigned to 
pixels in which the corresponding rare-to-common stable isotope ratio is equi- 
valent to the natural ratio. An important advantage of an HSI image is that the 
scale can be extended or compressed to provide an optimal colour representation 
of regions of differential stable isotope enrichment. However, any image manip- 
ulations, such as those resulting from scale changes, do not modify the quantitative 
data underlying the colour representation. 

Data processing and export. MIMS images are viewed and processed using Open 
MIMS software, a custom-built plugin to the Image] software (http://www.nrims. 
harvard.edu/software.php). In the experiments contained herein, nuclei were 
manually identified as regions of interest within the crypts of the small intestine 
using a combination of '“N and *'P mass images. Cellular identity was assigned by 
an observer unaware of the labelling status of the cells. Paneth cells were identified 
on the basis of their characteristic cytoplasmic granules seen in ‘“N and *’S images, 
large round nuclei seen in *N and *’P images, and their anatomic location at or 
near the base of the crypt (Supplementary Fig, 9). ‘"N-labelled Paneth cells (Fig. 2 
and Supplementary Fig. 8 Experiment 1) had a number of granules with a median 
of 8 (interquartile range = 5-11, n = 218). The granules were not separated from 
the nucleus by a plasma membrane, which would have been detected by MIMS. 
CBC cells were identified on the basis of their location between Paneth cells at the 
base of the crypt and their narrow, elongated nuclei. The +4 position was assigned 


to the first non-Paneth cell superior to the Paneth zone at the base of the crypt. 
Cells in the +4 position and above (transit amplifying cells) typically had elliptical 
nuclei. 

'SN-labelling is expressed as the '°N:'4N ratio (percentage above natural ratio) 
obtained using the following equation: 


(5N :!4N ratio) — (natural ratio) 


natural ratio 


1SN :4N(% above natural ratio) x 100 
Mosaic images. The maximum field size analysed in this study was 80 tm X 80 
um. The presentation of images from larger tissue swaths is made possible by the 
construction of mosaic images in which a series of adjacent fields are analysed. 
After MIMS acquisition, the full series of tiles is assembled to form a larger mosaic 
image. 

Statistics. Data were analysed using JMP 8.0.1 and Prism 3.0 (GraphPad). Two- 
way t-tests were performed for comparisons of two experimental groups. For 
analyses of multiple experimental groups, one-way ANOVA and Bonferroni 
correction were used. Non-parametric data were analysed with a Kruskal- 
Wallis test and Dunn’s multiple comparisons test. Binomial testing was used to 
compare observed versus predicted event frequencies. 

Reagents. '°N-thymidine and '*C-thymidine were obtained from Cambridge 
Isotopes. '°N-thymidine, used for human administration, was subjected to addi- 
tional endotoxin and microbial limits testing before packaging. ‘*C-palmitate was 
obtained from Sigma-Aldrich. Bromodeoxyuridine (BrdU) and iododeoxyuridine 
(IdU) were obtained from Sigma-Aldrich. 

Cell culture. Human foreskin fibroblasts were grown in DMEM/Ham’s F12 sup- 
plemented with 10% FBS, 1% penicillin/streptomycin/L-glutamine. For in vitro 
labelling experiments, fibroblasts were grown on silicon chips. Prior to MIMS 
analysis, cells were fixed and spun dry (Headway Research Spinner, PWM32) at 
3,500 r.p.m. for 2 min. 

Mice. C57B6 mice were used in accordance with the Guide for the Use and Care of 
Laboratory Animals and approved by the Harvard Medical School Standing 
Committee on Animals. Osmotic minipumps (Alzet) were implanted subcuta- 
neously. Jejunal segments were fixed (4% paraformaldehyde) and embedded (LR 
white). Sections (0.5 um) were mounted on silicon chips. 

Drosophila. Drosophila larvae were fed a standard yeast/cornmeal diet” supple- 
mented with 1% w/v '*C-labelled palmitic acid-1 (Sigma-Aldrich). For chase 
experiments, larvae were collected at 72h of development, following a label-free 
chase period of 0-24h, using standard food supplemented with 1% unlabelled 
palmitic acid (Sigma-Aldrich). Following dissection, anterior midguts or fat bodies 
were fixed in 2% formaldehyde, post-fixed in 2% glutaraldehyde then 2% osmium 
tetroxide, embedded in Epon resin, sectioned (thickness = 2 1m), and mounted 
on silicon chips. 

Human protocol. The Partners Institutional Review Board approved the 
protocol. After obtaining informed consent, the subject was admitted to the 
Brigham and Women’s Hospital clinical research unit. Prior to administration, 
’N-thymidine was resuspended in 500ml 0.9% NaCl and passed through a 
0.2 pum filter. The subject received °N-thymidine intravenously (30 mg bolus over 
5 min, then 15mg h! for 48 h) without adverse events. Peripheral blood was 
collected in an EDTA-containing tube at the conclusion of the infusion and after 
a 4-week chase. The blood sample was centrifuged (300g, 5 min), the buffy coat 
collected, and red blood cells lysed (ACK buffer, Invitrogen). The white blood cell 
suspension was smeared onto silicon chips and fixed with 4% paraformaldehyde. 
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Multi-isotope imaging mass spectrometry reveals 
slow protein turnover in hair-cell stereocilia 
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Hair cells of the inner ear are not normally replaced during an 
animal’s life, and must continually renew components of their 
various organelles’. Among these are the stereocilia, each with a 
core of several hundred actin filaments that arise from their apical 
surfaces and that bear the mechanotransduction apparatus at their 
tips. Actin turnover in stereocilia has previously been studied” by 
transfecting neonatal rat hair cells in culture with a B-actin-GFP 
fusion, and evidence was found that actin is replaced, from the top 
down, in 2-3 days. Overexpression of the actin-binding protein 
espin causes elongation of stereocilia within 12-24 hours, also 
suggesting rapid regulation of stereocilia lengths’. Similarly, the 
mechanosensory ‘tip links’ are replaced in 5-10 hours after cleav- 
age in chicken and mammalian hair cells**. In contrast, turnover in 
chick stereocilia in vivo is much slower’. It might be that only 
certain components of stereocilia turn over quickly, that rapid 
turnover occurs only in neonatal animals, only in culture, or only 
in response to a challenge like breakage or actin overexpression. 
Here we quantify protein turnover by feeding animals with a '°N- 
labelled precursor amino acid and using multi-isotope imaging 
mass spectrometry to measure appearance of new protein. 
Surprisingly, in adult frogs and mice and in neonatal mice, in vivo 
and in vitro, the stereocilia were remarkably stable, incorporating 
newly synthesized protein at <10% per day. Only stereocilia tips 
had rapid turnover and no treadmilling was observed. Other 
methods confirmed this: in hair cells expressing f-actin-GFP we 
bleached fiducial lines across hair bundles, but they did not 
move in 6 days. When we stopped expression of B- or y-actin with 
tamoxifen-inducible recombination, neither actin isoform left the 
stereocilia, except at the tips. Thus, rapid turnover in stereocilia 
occurs only at the tips and not by a treadmilling process. 

To understand protein turnover in the inner ear, we sought a 
method that could reveal new synthesis with high spatial resolution, 
in adult animals, in vivo, and without transfection of cells. We fed frogs 
with food containing L-leucine tagged with the stable isotope '°N for 
up to 32 days, and killed them by cardiac perfusion. As leucine is an 
essential amino acid, newly synthesized protein would contain '°N in 
nearly the same proportion as the food. To locate '°N-tagged protein, 
plastic sections of frog saccules were placed onto silicon wafers or 
electron microscopy grids for multi-isotope imaging mass spectro- 
metry (MIMS) imaging. We detected nitrogen as the CN ion, 
with the MIMS detectors set for the masses of '*C, 13°C, '*C!“N and 
12C15N, and calculated the ‘°N/"4N ratio at each location in a field”*® 
(see Methods). 

A mass-26 ('*C'*N) image of the saccular macula, representing total 
protein, shows the apical surface ofa hair cell with its hair bundle (Fig. 1a, 
left). Lateral resolution approaches 30nm, and depth resolution is 
~2nm (ref. 7). The mass-27 channel ('*C'°N; not shown) revealed 
newly synthesized protein, and the mass-27/mass-26 ratio indicated 


per cent incorporation. A ratio image (Fig. la, right, 32 days of feeding) 
shows that the stereocilia incorporate new protein more slowly than 
the cell body. We quantified turnover by summing the mass-26 and 
mass-27 counts for all pixels in each region of interest (ROI) and 
calculated per cent incorporation (Fig. 1c). Hair-cell cytoplasm and 
cuticular plates incorporated new protein at ~0.5% per day. But 
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Figure 1 | Incorporation of '°N into frog saccular epithelium. a, Left, mass- 
26 image for a hair cell (HC) and supporting cells (SC) at day 32. CP, cuticular 
plate; HB, hair bundle; OM, otolithic membrane. Right, image of mass-27/ 
mass-26 ratio, revealing incorporation of °N. Colour scale represents 0-30% 
incorporation. Field is 20 X 20 um. b, Top, mass-26 image of the shortest 
stereocilium in a. Bottom, profile of incorporation along its length. c, Turnover 
for days 1, 2, 4, 8, 21 and 32. cutic, cuticular plate; cyto, cytoplasm. Data are 
shown as mean ~ standard error of the mean (s.e.m.); N was typically 8-10. 
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stereocilia incorporated new protein at <0.3% per day, far lower than 
the 50% per day suggested for actin in mammalian cochlea’. One 
difference is that MIMS assesses total protein; perhaps actin turns over 
rapidly but most other proteins are very stable. However, actin is 50- 
60% of the total protein in stereocilia?’® and would account for the 
majority of the MIMS signal. 

Surprisingly, incorporation seemed higher at the very tips of stereocilia 
(Fig. 1b). Incorporation at the tip almost equalled that of cytoplasm 
(Fig. 1c). In this bundle, incorporation at the tips of all stereocilia was 
12.5%, compared to 6.1% in the shafts. We also measured incorpora- 
tion in individual ROIs along the length of a stereocilium. Of 37 frog 
stereocilia, there was no correlation with height in 31 and positive 
correlation in just one, inconsistent with movement of the ISN label 
from top to bottom. 

Protein turnover in bullfrog stereocilia was much slower than that 
inferred for mouse and rat hair cells*"', leading us to wonder whether 
turnover is more rapid in mammals. We fed adult mice '"N-leucine 
and killed them after 1, 2, 8, 32, 56 or 150 days. In the mouse vestibular 
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Figure 2 | Incorporation of '°N into adult mouse hair cells. a, Mass-26 image 
of utricle; day 56. b, Mass-27/mass-26 ratio: low incorporation in stereocilia. 

c, Projection of a three-dimensional stack of a. d, Ratio image from ec: high 
turnover at tips. e, Incorporation, days 1-56. cutic, cuticular plate; cyto, 
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system, '°N incorporation was faster in the cytoplasm of hair cells and 
supporting cells (Fig. 2a, b, e; ~7% per day initially) but still slow in 
stereocilia (<3% per day initially). This was not a peculiarity of actin- 
based structures: intestinal-brush-border microvilli showed more 
incorporation than the cytoplasm, and much more than stereocilia 
(Supplementary Fig. 2). 

The MIMS instrument acquires repeated images, slowly etching the 
surface to create thousands of XY planes with a Z resolution of approxi- 
mately 1 nm allowing the creation of a three-dimensional representa- 
tion (Fig. 2c, d and Supplementary Movies 1 and 2). In mouse utricle, 
incorporation was again much higher in the tips than elsewhere in 
stereocilia. Although tip labelling would be expected in the initial phase 
of treadmilling, the high-turnover region never extended much below 
the top micrometre, which apparently represents a fixed zone that does 
not move down by treadmilling. 

We then wondered whether turnover is more rapid in cochlear than 
vestibular hair cells. Apparently not: at 8, 32, 56 and 150 days (Fig. 2f-)), 
cochlear stereocilia had slower incorporation of new protein (~2% 


15N incorporation (%) 


Length (tum) 


J 
400+ OO OHCcutic  OHC bundle 
@ IHC cutic @ IHC bundle 
A OHC cyto © TMcentre 
sok 4A IHC cyto 
& 
= 
£ 60 
g 
o 
- 
ra] 
2 40 
z 
5 


20 


120 


0 40 80 
15N feeding (days) 


cytoplasm; HC, hair cell; SC, supporting cell. f, Cochlear inner hair cell; day 56; 
mass 26. g, Ratio image. h, Three-dimensional reconstruction from 450 images. 
i, Incorporation along axis of stereocilia from h. Bar: plotted length. Line widths: 
95% confidence. j, Incorporation after 8, 32, 56 or 150 days. Colour scale 0-100%. 


26 JANUARY 2012 | VOL 481 | NATURE | 521 


©2012 Macmillan Publishers Limited. All rights reserved 


LETTER 


initially) than did cell bodies or cuticular plates (~3.5% initially). 
Moreover, adult stereocilia apparently have a pool of protein that is 
replaced exceedingly slowly: '°N incorporation after 5 months was 
<60%. 

A three-dimensional image of cochlear hair bundles (Fig. 2h and 
Supplementary Movie 3) also showed high turnover in the distal 
micrometres of the sterocilia. At 56 days, DN incorporation was 40% 
along most of the length of individual stereocilia, but >80% at the tips 
(Fig. 2i). Except for the tips, we did not see a gradient of incorporation. 
In addition, the resolution of MIMS revealed higher turnover in the 
region of the cell membrane than in stereocilia cores (Fig. 2i, inset), 
perhaps reflecting rapid diffusion of some membrane proteins’’. 

Elsewhere in the cochlea, ‘°N incorporation varied considerably 
(Supplementary Fig. 1). Like the frog otolithic membrane, the mouse 
tectorial membrane was exceptionally stable, with ~10% '°N incorp- 
oration in 5 months. Pillar cell shafts, containing stiff microtubule 
bundles, had just 23% incorporation. The reticular lamina, comprising 
the actin-rich cuticular plates of hair cells, terminal webs of Dieter’s 
cells and their zonulae adherens, showed low turnover. Thus, low 
turnover occurred in structures that convey sound to stereocilia. 

Protein turnover could be more rapid in neonatal mice. To test this, 
mice were moved at birth to surrogate mothers that had been fed '°N for 
24-28 days to raise the '°N content of their milk; pups were nursed for 
0-15 days before being killed. In a second group, mothers were fed '°N 
before and during pregnancy. At birth, pups were moved to 
control nurse mothers, and protein turnover was measured as ISN Joss. 
Pups nursed by '°N mothers showed a normal rise in '°N in utricular 
cytoplasm (Supplementary Fig. 3), with 30-50% incorporation in 4 
days. In stereocilia, protein turnover was slower, with 25% in 4 days 
(Supplementary Fig. 3b, g). A three-dimensional rendering also showed 
low turnover in stereocilia, except at the tips (Supplementary Fig. 3c). 

A difficulty in interpretation is that about half the hair bundles in the 
mouse utricle develop postnatally’’, so incorporation of new protein in 
the stereocilia might reflect development rather than turnover. Indeed, 
small bundles, presumably still developing, showed high incorporation 
(Supplementary Fig. 4), and were therefore excluded from the analysis 
in Supplementary Fig. 3g. Another difficulty is the assumption that 
tagged leucine is incorporated equally into proteins. Proteins with a 
lower abundance of leucine than the average (9.36%) would have a 
lower '°N/™N ratio, which might be interpreted as slow turnover. In 
fact, stereocilia proteins do have less leucine. The ten most abundant 
proteins of neonatal rat utricular stereocilia (P. Gillespie, personal 
communication), adjusted for abundance, have 8.53% leucine. This 
is not enough to account for low stereocilia incorporation, but we 
sought a different way to measure turnover. 

We therefore measured the loss of '°N in pups born to '°N-fed 
mothers but nursed by controls, normalizing '°N levels in each struc- 
ture to that at postnatal day (P)0 (Supplementary Fig. 3h). Consistent 
with some postnatal bundle development, small bundles had little or no 
label, but full-sized hair bundles retained °N for many days, with 70% 
remaining at P4 (Supplementary Fig. 3d—f, h and Supplementary Fig. 4). 

Perhaps turnover in stereocilia is more rapid in vitro. We cultured 
utricles from PO pups and added '°N-leucine after one day. Over four 
more days, hair bundles showed relatively low '°N incorporation (~8% 
per day versus 12-16% per day in cytoplasm; Supplementary Fig. 5). 

Perhaps the fluorescence previously observed’ was due to newly 
synthesized B-actin-GFP that was only a small fraction of the actin 
in the core—in essence a tracer for treadmilling, the bulk of which was 
driven by recycling of actin monomers that are only slowly replaced. 
To assess treadmilling with an alternative to MIMS, and with steady 
rather than transient expression of a tracer, we constructed mouse 
lines expressing f-actin-GFP under the myosin-7a promoter 
(Supplementary Fig. 6). Stereocilia were brightly fluorescent (Fig. 3). 
We dissected utricles at ages PO-P5 and cultured them for 1 day. Ina 
confocal microscope, we selected a field in which the epithelium was 
tilted sideways, and bleached the GFP in a 1-j1m-wide line mostly 
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perpendicular to the bundle axis, for 5-8 bundles (Fig. 3b, c). A 
three-dimensional confocal stack was acquired, and the utricle placed 
back into culture. Two, four and six days later, each utricle was returned 
for imaging (Fig. 3d). The bleached line was sometimes obscured if the 
bundle axis changed in culture, but the three-dimensional image could 
be used to reconstruct it. In all experiments, we saw no significant 
movement of the bleached line (Fig. 3f). On average, the line moved 
by 2 + 3% of the bundle height in 6 days (Fig. 3g). 

It is possible that our cultures were not healthy, and that treadmill- 
ing was impaired. However, in the same utricles, we observed the 
appearance and growth of new bundles. Moreover, both bleached 
and unbleached hair cells took up the fluorescent dye RH414, a marker 
for functional transduction (data not shown). Thus an independent 
method gave no evidence of treadmilling on a timescale of days. 

The GFP tag could have inhibited treadmilling in Myo7a::B-actin:GFP 
mice. We therefore studied treadmilling by terminating B- or y-actin 
expression and following the redistribution of actin with antibodies. 
Mice expressing a tamoxifen-inducible Cre under the B-actin promoter 
were crossed with mice carrying either a floxed B- or y-actin gene™. At 3 
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Figure 3 | Tracking treadmilling by bleaching GFP-tagged f-actin. 

a, Neonatal utricular macula in culture, hair cells expressed -actin-GFP. 
Bleached bundles are indicated by asterisks. b, Projection of confocal planes 
before and after bleaching. D1, day 1. c, Three-dimensional stack rotated to 
show oblique bleached line. d, Same bundle on days (D)3, 5 and 7. A shift in 
bundle orientation partially obscured the line. e, Rotation of the day-7 stack 
revealed the bleached line. f, Line positions for three bundles, as a proportion of 
bundle height. g, Average movement of bleached lines, as a proportion of 
bundle height. N = 24; mean ~ s.e.m. 
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weeks of age, after bundles had developed, mice were given tamoxifen to 
promote excision. One, three, eighteen or thirty-four weeks later, we fixed 
and removed cochleas, and used antibodies specific for B- and y-actin. 

In control mice without tamoxifen, B- and y-actin were uniformly 
distributed (Fig. 4a). If there is rapid treadmilling in adults, B-actin 
protein should be lost from stereocilia soon after excision prevented 
B-actin gene expression. Similarly, y-actin should be lost in mice with a 
floxed y-actin gene. This occurred, but only for a 0.4-~m region at 
stereocilia tips (Fig. 4a, b). The rest of the stereocilium retained the 
deleted actin isoform, and the region of isoform loss did not progress 
towards stereocilia bases, even after 34 weeks. 

The MIMS method provides a new way to study protein stability in the 
inner ear’. MIMS is based on Schoenheimer’s pioneering use of stable 
isotopes to reveal the dynamics of cellular components’, but adds imaging 
with exceptional spatial resolution. '°N incorporation can be calculated 
with an accuracy that depends only on acquisition time. It does not rely 
on tagging any one protein species, or transfecting a specific cell type. 

In contrast to previous studies”, MIMS revealed unexpectedly low 
turnover of protein in hair-cell stereocilia. Incorporation of '°N in 
stereocilia occurred at <0.3% per day in frog, <2% per day in adult 
mouse and <9% per day in neonatal mice, incompatible with tread- 
milling in 2-3 days. By bleaching fluorescently labelled actin in bun- 
dles, we saw no significant movement of the actin. Finally, excision of 
the B- and y-actin genes in mature mice showed loss of the excised 
actin from the stereocilia tips, but not a progressive loss from tip to 
base. We found no evidence for treadmilling of stereocilia cores, over 
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Figure 4 | Loss of actin from tips following knockout of actin genes. 

a, b, Adult (3-week-old) cochlear inner hair cells, labelled with antibodies to 
B-actin (green) or y-actin (red). a, Floxed f-actin mouse crossed to Cagg- 
CreER, without tamoxifen (control), or 1, 3 or 34 weeks (wk) after tamoxifen. 
Loss of f-actin occurred only at tips of stereocilia. In control mice, a few hair 
cells displayed loss of B-actin at tips without tamoxifen, suggesting a leaky 
promoter. b, Floxed y-actin mouse 18 weeks after tamoxifen. Loss of y-actin 
occurred only at tips of stereocilia. 
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weeks or even months. It is possible that the previously observed 
treadmilling was produced by overexpression of B-actin, if the exo- 
genous actin significantly exceeds endogenous actin. Or—at least in 
the vestibular system—addition of actin to hair bundles developing 
postnatally might have confused interpretation. 

Nevertheless, there is a conspicuously dynamic turnover of protein 
at the tips of stereocilia, in the distal 0.3-0.5 um. This could represent 
turnover of transduction components, which can be replaced in hours 
when damaged*”. It could represent movement of proteins to the tips 
by hair-cell myosins, most of which climb from bases to tips. Or it may 
reflect a rapid regulation of actin filaments, which can change length 
by ~0.1 jum when tip links are cut’. If so, there might be both poly- 
merization and depolymerization at the barbed end. Rapid but local 
regulation of filament length is not inconsistent with the limited 
growth of stereocilia induced by overexpression of espin’. 

In long-lived vertebrates, the actin of stereocilia must be replaced. It 
may be that treadmilling does occur, but at a timescale far longer than 
that studied here. It may instead be that individual actin filaments do 
not extend the length of the stereocilium—that free ends can accept or 
shed monomers and that unsynchronized treadmilling occurs on a 
submicroscopic scale. Unlike filopodia, stereocilia seem to develop 
once, elaborating an intricate staircase of heights that persists for the 
life of an animal; in most of their length, they then replace individual 
proteins slowly and without disrupting this exquisite structure. 


METHODS SUMMARY 


Tissue preparation and '°N feeding. Adult bullfrogs (Rana catesbeiana) were 
maintained on a !°N-leucine diet, then fixed and treated with EDTA to dissolve 
otoconia before embedding and sectioning. 

Adult mice (Mus musculus, CD1 strain) were maintained on a '°N-leucine diet 
before being killed and processed. For neonatal mice, mothers were maintained on 
'°N food before and during pregnancy; at birth, the pups were moved to a control 
surrogate mother. A second set was born to control mothers and moved at birth to 
a surrogate mother that had been maintained on '°N food. 

Utricles were dissected, cultured overnight in DMEM-F12 with 1 or 6% FBs'® 
and maintained in '"N-enriched medium for up to 5 days. 

MIMS analysis. MIMS was performed as described previously’. '°N incorpora- 
tion was calculated by comparing counts for masses 27 and 26 to the natural 
abundance of '°N (0.37%) and the relative abundance of '°N in the food (1-2%). 
Photobleaching of cultured mouse utricles. We constructed mice expressing a 
-actin-GFP fusion under the myosin-7a promoter, integrated in the HPRT locus 
on the X chromosome (see Supplementary Fig. 6). Utricles from neonatal pups 
(mosaic for the transgene) were cultured as above. Hair bundles were bleached on 
day 1; images were taken on days 3, 5 and 7. 

Conditional deletion of B- and y-actin. Actg#"”* and Act”* mice, in which exons 
3 and 4 were flanked by loxP sites'* were crossed to mice expressing Cre/Esr1 
recombinase driven by the chicken B-actin promoter coupled to the cytomegalo- 
virus immediate-early enhancer. Mice were dosed with tamoxifen to induce 
recombination. Anti-y-actin antibody clone 1-37 (ref. 15) IgG was conjugated 
to Alexa fluorescent dyes. FITC-labelled anti-B-actin antibodies (clone AC-15) 
were obtained from Abcam. 


Full Methods and any associated references are available in the online version of 
the paper at www.nature.com/nature. 
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METHODS 


Tissue preparation and '°N feeding. Adult bullfrogs (Rana catesbeiana) were fed 
an unlabelled control diet for 2-3 days, and were transferred at day 0 toa '°N-rich 
diet. Food pellets contained '°N-labelled leucine so that the °N content was 
1.28%. Bullfrogs were fixed via cardiac perfusion with 4% formaldehyde + 0.1% 
glutaraldehyde. After decapitation, the saccules were exposed and the head and 
carcass stored in 4% formaldehyde + 1% glutaraldehyde. Heads were transferred 
to PBS before dissection of the saccules. Saccules were then treated with EDTA for 
2-10 days to dissolve otoconia before embedding in Epon and sectioning. 

Adult mice (Mus musculus, CD1 strain) ate control food that contained no 
added 1°N, and then were switched to food with !°N-leucine (either 1.28 or 
1.23%) for up to 150 days. Neonatal mice were labelled in two ways. For one 
set, mothers were fed !°N food for 4-8 days, mated, and continued on ISN food 
until pups were delivered (‘°N mother). At birth, the pups were moved to a 
surrogate mother that had been maintained on control food (‘4N mother). A 
second set was born to ‘*N mothers and moved at birth to a surrogate mother 
that had been maintained for 24-28 days and was continued on '°N food. Pups 
were nursed by surrogate mothers for 0, 2, 4, 8 or 15 days before being killed and 
fixed. At least two animals (four inner ears) were prepared for each condition. 

Cultured utricles were dissected by standard methods’’. After removing the 
otolithic membranes, the maculae were plated on a sterile portion of ACLAR film 
(EMS) and incubated in DMEM-F12 containing 100 wg ml carbenicillin, with 
either 1 or 6% FBS. After an overnight incubation, the medium was replaced with 
'SN-enriched medium (DN, Cambridge Isotopes, 2.44 or 1.35% ISN), The cultures 
were then incubated at 37 °C for up to 5 days until fixation. Tissues were fixed in 
4% formaldehyde + 0.1% glutaraldehyde in cacodylate buffer for 15 min at 22 °C, 
then overnight at 4 °C. Cultures were then moved to cacodylate buffer and held for 
2-3 days at 4°C, before processing in batches for plastic embedding, as above. 
MIMS analysis. In the multi-isotope imaging mass spectrometer’, a ~30nm 
beam of Cs* ions, scanned across the sample, sputters atoms, atomic clusters or 
molecules from the surface, of which a small fraction are ionized. Negative 
secondary ions are accelerated back, then shaped with ion optics and analysed 
with a double sector mass spectrometer employing four detectors, each tuned to a 
specific mass. Because N) is not charged, '°N and '*N were detected as cyanide 
anions, '*C!°N~ and !*C'4N, mass 27 and mass 26, respectively, and their ratio 
was calculated for each point in the raster scan. 

We calculated the proportion of newly synthesized protein, for each pixel of the 
field, from the ratio of counts in the '*C’°N and '*C@N images according to 


Nx 
‘ é Ny —An 
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where N37 and Ny. are the counts in each pixel for masses 27 and 26, A, is the 
natural abundance of '°N (0.37%), and Aris the relative abundance of '°N in the 
food (1-2%). Incorporation was displayed with a hue saturation intensity trans- 
formation (HSI) in which the hue corresponds to the ratio value, and the intensity 
is an index of statistical reliability*. Image analysis performed in Image]/ 
OpenMIMS (NIH/NRIMS). 3D reconstructions made in ImageVis3D (NIH/ 
NCRR SCI/CIBC). 

Photobleaching of cultured mouse utricles. To label endogenous actin, we con- 
structed mice expressing a B-actin-GFP fusion under the myosin-7a promoter, 
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integrated in the HPRT locus on the X chromosome. Correct integration of the 
BAC was confirmed Southern blot using flanking probes. Details of the construc- 
tion, using methods previously described”’, are in Supplementary Fig. 6. 

Neonatal pups (heterozygous and therefore mosaic for the transgene) were 

killed between PO and P4, their utricles dissected and cultured as above. 
Dissection was performed on experimental day 0, then stereocilia were bleached 
on day 1 and images were taken on experimental days 3, 5 and 7. Each imaging day, 
the cells were transferred to DMEM-F12 without phenol red and then to the stage 
of an Olympus FV1000 confocal (upright configuration, LUMFL X60 immersion 
objective, 1.1 NA) with a warmed chamber. On day 1, a Z-stack of the selected field 
containing 20-50 hair cells was collected using confocal imaging of the GFP with a 
488 nm laser. Target hair bundles were selected and bleached with one to three 
300-ms flashes of a focused 405 nm laser at 20 ps pixel’. Each selected bundle in 
the field was separately bleached. After bleaching, another Z-stack for the whole 
field was collected to confirm the bleached locations. The following experimental 
days one Z-stack per field was acquired. Movement of the bleached line was 
assessed by rotating the image stack to observe the line relative to the full length 
of the bundle. 
Conditional deletion of B- and y-actin. The Actg#”™ and Actb"* mice, in which 
exons 3 and 4 were flanked by /oxP sites'* were crossed to transgenic mice expres- 
sing Cre/Esr1 recombinase driven by the chicken f-actin promoter coupled to the 
cytomegalovirus immediate-early enhancer (Jackson Labs; stock number 004682). 
All mice were on the C57BL/6 background. Mice were dosed with tamoxifen once 
a day for 4 days to induce recombination (20 mg ml! in corn oil, administered as 
0.15 mg tamoxifen per gram of mouse by oral gavage). Esr1 control of Cre recom- 
binase seemed to be leaky, in that 18-week-old mice that had not been given 
tamoxifen had the same actin-distribution phenotype as mice given tamoxifen; 
both were different from mice lacking CAG-Cre/Esr1. The phenotype was seen in 
a few cells of 3-week-old mice even in the absence of tamoxifen, so we restricted 
quantitative analysis to mice dosed with tamoxifen at 3 weeks. 

Monoclonal mouse anti-y-actin antibody clone 1-37 IgG (ref. 15) was purified 
from ascites using a T-gel Purification kit (Pierce) and conjugated to either Alexa- 
488 or Alexa-568 fluorescent dyes using a Monoclonal Antibody Labelling Kit 
(Invitrogen) following the manufacturer’s instructions. FITC-labelled anti-B-actin 
antibodies (clone AC-15) were obtained from Abcam (validated in ref. 14). 

For labelling, mice were perfused with 4% paraformaldehyde in PBS. Cochleas 
were then dissected and incubated in the same fixative for 2 h at room temperature 
(22 °C), washed in PBS and decalcified in 170 mM EDTA in PBS at 4 °C for 16h. 
The organ of Corti was dissected, postfixed in 100% methanol at —20°C for 
10 min, rinsed in PBS and permeabilized in 0.5% Triton X-100 in PBS for 
20min at room temperature. Tissue was blocked for 1h in 5% goat serum in 
PBS before incubation with the indicated antibodies. Samples were mounted in 
ProLong anti-fade reagent and viewed on a Deltavision PersonalDV deconvolu- 
tion microscope equipped with a 100 1.4 NA objective (Applied Precision). 
Stacks of images were collected at 0.20-um intervals and deconvolved using 
Resolve3d software (Applied Precision). 


17. Prosser, H. M., Rzadzinska, A. K., Steel, K. P. & Bradley, A. Mosaic complementation 
demonstrates a regulatory role for myosin Vllain actin dynamics of stereocilia. Mol. 
Cell. Biol. 28, 1702-1712 (2008). 
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Molecular recognition of a single sphingolipid species 
by a protein’s transmembrane domain 


F.-Xabier Contreras'*, Andreas M. Ernst!*, Per Haberkant't, Patrik Bjorkholm?"%, Erik Lindahl**, Basak Génen!, 
Christian Tischer°, Arne Elofsson”?, Gunnar von Heijne*’, Christoph Thiele°, Rainer Pepperkok®, Felix Wieland! & Britta Briigger! 


Functioning and processing of membrane proteins critically 
depend on the way their transmembrane segments are embedded 
in the membrane’. Sphingolipids are structural components of 
membranes and can also act as intracellular second messengers. 
Not much is known of sphingolipids binding to transmembrane 
domains (TMDs) of proteins within the hydrophobic bilayer, and 
how this could affect protein function. Here we show a direct and 
highly specific interaction of exclusively one sphingomyelin 
species, SM 18, with the TMD of the COPI machinery protein 
p24 (ref. 2). Strikingly, the interaction depends on both the head- 
group and the backbone of the sphingolipid, and on a signature 
sequence (VXXTLXXIY) within the TMD. Molecular dynamics 
simulations show a close interaction of SM 18 with the TMD. We 
suggest a role of SM 18 in regulating the equilibrium between an 
inactive monomeric and an active oligomeric state of the p24 
protein**, which in turn regulates COPI-dependent transport. 
Bioinformatic analyses predict that the signature sequence repre- 
sents a conserved sphingolipid-binding cavity in a variety of 
mammalian membrane proteins. Thus, in addition to a function 
as second messengers, sphingolipids can act as cofactors to regulate 
the function of transmembrane proteins. Our discovery of an 
unprecedented specificity of interaction of a TMD with an indi- 
vidual sphingolipid species adds to our understanding of why bio- 
logical membranes are assembled from such a large variety of 
different lipids. 

Lipidomics of Golgi-derived COPI vesicles unravelled a partial 
segregation of cholesterol and most sphingomyelin species except 
for one (N-stearoyl sphingomyelin) from the vesicle fraction’. To 
understand the molecular mechanisms of SM 18 enrichment in vesicles, 
we investigated in vivo binding of lipids to p24 and p23, membrane 
proteins involved in COPI vesicle biogenesis (Supplementary Fig. 1)°. 
Lipid crosslinking’* revealed a strong sphingolipid labelling of p24 
(Fig. 1a, b, see also Supplementary Fig. 2), with a fivefold higher recovery 
of radioactivity per mole of protein compared to p23. To analyse if 
this interaction of p24 with sphingomyelin is specific for SM 18, we 
established a liposomal assay to study TMD-lipid interactions (see 
Supplementary Fig. 3). Maltose-binding protein (MBP) fusions of the 
TMDs of p23 and p24 were analysed in a liposomal Forster resonance 
energy transfer (FRET) system for their interaction with pentaenoyl-SM 
18 (SM 18:5, as a fluorescently labelled analogue of endogeneous SM 
18:0 (ref. 9), see Supplementary Information), pentaenoyl-ceramide 18, 
or pentaenoyl-phosphatidylcholine (PC) 18 (Fig. 1c). Distinct FRET 
was observed between the TMD of p24 and SM 18:5, but not with the 
TMD of p23. A mutant lacking the single Trp residue within the TMD 
(p24(TMD)W4A) did not trigger a FRET. With ceramide 18:5 no FRET 
was observed for any TMD. With PC 18:5 weak FRET was observed 
with the p24(TMD), and a slightly stronger signal was obtained with 
p23(TMD). Together these results show a marked specificity of p24 for 


sphingomyelin, with both the hydrophobic moiety and the hydrophilic 
choline phosphate headgroup needed for the interaction. We next 
analysed the extent of FRET obtained between the TMDs of p24 and 
p23 and pentaenoyl-sphingomyelins covering the range of the major 
endogenous molecular species from C14 to C24. A remarkable specificity 
was observed for the interaction of p24 with SM 18 in liposomes com- 
posed of di-oleoyl-phosphatidylcholine/phosphatidylethanolamine/ 
pentaenoyl-sphingomyelin (Fig. 1d, left panel). In liposomes more 
closely reflecting the lipid composition of the mammalian Golgi 
apparatus we again observed a striking specificity for SM 18, with weak 
but significant signals for SM 20 and SM 22 (Fig. 1d, right panel). In 
contrast, the TMD of p23 did not give rise to comparable FRET with any 
of the sphingomyelin species. 

To define the structural prerequisites for SM 18 binding, we per- 
formed an Ala scan across the TMD of p24 starting with position 8 
within the TMD, leaving three amino acids before and after W4 
unchanged, to minimise a direct influence on the biophysical properties 
of the FRET donor. Two groups of mutants were observed, with a 
strong inhibition of FRET with Ala substitutions in positions 8, 10, 
11, 12, 15, 19 and 20 (group 1), and less inhibition in positions 13, 
16, 17, 18 and 21 (group 2) (Supplementary Fig. 4). Although group 1 
mutants showed a strong inhibition of FRET, they maintained a high 
degree of specificity for SM 18:5 (for example, V11A, Supplementary 
Fig. 5a). In contrast, group 2 mutants V13, T16, L17 and Y21 showed 
strongly compromised species specificity (Supplementary Fig. 5c-f). 
Mutant G18A was not further investigated, because G18W had not 
shown an alteration of SM 18 binding (Supplementary Fig. 3d), indi- 
cating that G18 is not directly involved in the lipid—p24 interaction. 
From the above data we deducted that a carboxy-terminal motif made 
of amino acid residues V13, T16, L17 and Y21 of the p24 TMD (V181, 
T184, L185 and Y189 in the full-length protein) represents a structural 
determinant for the specific binding of SM 18 (see also Supplementary 
Discussion). An energy-minimised projection of the TMD yielded the 
structure depicted in Fig. 2a (left). In this model, a groove is formed 
from the Tyr residue in position 21 to the Val residue in position 13. 
Within the sphingomyelin-binding motif, the B-branched residue Ile is 
found in position 20 of the TMD. B-branched residues were shown to 
contribute to TMD-TMD interactions’®”’ by conveying higher rigidity 
and thereby allowing for stronger London dispersion forces. In the 
model, this residue is part of the cavity and thus is likely to contribute 
to the binding motif. If this groove indeed accommodates the backbone 
of SM 18, the insertion of a bulky hydrophobic residue in position 17 
should prevent lipid binding. To fill this cavity with minimal alterations 
of the helical structure’* we introduced a Phe residue in position 17 of 
the TMD. FRET analysis showed that binding of SM 18 was completely 
inhibited (Fig. 2b). To challenge the existence of a C-terminal binding 
motif, we transferred the C-terminal half of the p24 TMD to the amino- 
terminal half of the non-sphingomyelin-binding TMD of p23. The 
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Figure 1 | p24 specifically interacts with SM 18. 
a, CHO cells were grown in the presence of 200 Ci 
of [°H]-photo-sphingosine. Cells were ultraviolet- 
irradiated, lysed and subjected to 
immunoprecipitation using antibodies against p23 
or p24. Radioactivity recovered from input, 
supernatant (SN) and immunoprecipitation (IP) 
was visualized by autoradiography (upper panels). 
Lower panels, western blot analysis. 

b, Quantification of immunoprecipitated 
radioactivity. ¢, In vitro FRET analysis of MBP- 
TMD fusion proteins and pentaenoyl-lipids. 
Proteoliposomes contained either SM 18:5, 
pentaenoyl-ceramide 18:5 (Cer) or PC 18:5. Red 
curve, p23(TMD); black curve, p24(TMD); blue 
curve, p24(TMD)W4A. d, Proteoliposomes were 
prepared in the presence of p24(TMD) (black bars) 
or p23(TMD) (grey bars) and 1 mol% of 


pentaenoyl-sphingomyelins, mimicking a liquid- 
disordered phase (left panel) or a mammalian 
Golgi membrane (right panel). Background- 
subtracted fluorescence data are normalized to SM 
18:5, and are the mean = s.d. of three experiments. 
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Figure 2 | Characterization of the sphingomyelin-binding pocket. 

a, Energy-minimised structure of p24(TMD), p23(TMD), p24(TMD)L17F, 
and p24/p23 chimaeric TMD. Residues of p24 involved in SM 18 recognition 
are depicted in red, the L17F mutation is highlighted in green. 

b, Proteoliposomes containing p24(TMD), p23(TMD), p24(TMD)LI17F or the 
p24(TMD)/p23(TMD) chimaera were prepared in the presence of 1 mol% of 
pentaenoyl-SM (18:5) (di-oleoyl-phosphatidylcholine/ 
phosphatidylethanolamine/pentaenoyl-sphingomyelin, molar ratio 89:10:1). 
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FRET measurements were performed as described above. Data are the 

mean ~ s.d. of three independent experiments. ¢, In vivo binding of [*H]- 
sphingolipids to YFP-p24 and YFP-p24L17F. Upper panel, autoradiography; 
lower panel, western blot. MM, molecular mass marker. d, Molecular dynamics 
simulations: snapshot of a SM 18-p24(TMD) interaction. The interacting lipid 
and residues displaying the signature are highlighted. Blue, p24(TMD)); red, 
sphingomyelin-binding pocket, yellow; SM 18:0 head group; green, SM 18:0 
backbone and N-acylated fatty acid. 
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chimaeric TMD yielded 50% of the level of FRET compared to the 
TMD of p24 (Fig. 2b). Interestingly, the chimaera shows promiscuity 
with regard to the sphingomyelin molecular species, binding significant 
amounts of SM 20 and SM 22 in addition to SM 18 (Supplementary 
Fig. 5b), pointing at a contribution of the N-terminal residues to the 
orientation of the sphingomyelin-binding pocket. The loss of FRET by 
mutant p24(TMD)LI7F suggested a loss of SM 18 binding. To test loss 
of binding in vivo, full-length wild-type p24 and p24L17F were 
expressed as yellow fluorescent protein (YFP) fusion proteins (Sup- 
plementary Fig. 6a). Sphingolipid labelling showed that p24L17F 
has lost the capability to bind sphingomyelin in vivo (Fig. 2c). 
Sphingomyelin is known to be synthesized in the luminal leaflet; 
however, in vitro experiments suggest that sphingolipids can flip 
between the two leaflets stimulated by free ceramide that translocates 
from the cytoplasmic to the luminal leaflet of membranes’ (see also 
Supplementary Fig. 6b). In addition, in vivo analyses using the 
sphingomyelin-binding toxin equinatoxin II point to an occurrence 
of sphingomyelin in the cytoplasmic leaflet of the Golgi membrane". 

To understand binding selectivity at the molecular level, we com- 
pared the structural features of sphingomyelin molecular species (see 
Supplementary Fig. 7). Molecular dynamics simulations and monolayer 
experiments'*” suggest that only the sphingomyelin species with a 
suitable dynamic volume would fit into the cavity of the p24 TMD 
formed by V13, T16 and L17, excluding lipids with larger dynamic 
volume (SM 14 and 16, see Supplementary Fig. 7). Sphingomyelin 
species with smaller dynamic volumes, however, would need to be 
excluded from the hydrophobic cavity by a different mechanism (see 
Supplementary Discussion). 

We next investigated the binding signature through a series of five 
molecular simulations of the TMD of p24 embedded in a POPC bilayer 
including sphingomyelins with fatty acids of C14 to C22. By extending 
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all atomistic simulations to 1 [is we observe spontaneous diffusion of 
sphingomyelins to the TMD and close interaction with the proposed 
binding site primarily for SM 18, and to a minor extent also SM 16 and 
SM 20 (Supplementary Fig. 8). The polar head group of sphingomyelin 
‘wraps around’ the Y21 side chain, while the C18 chain continues 
down around the helix and packs in the groove between V13, T16 
and L17 (Fig. 2d and Supplementary Movie 1). The sphingosine chain 
packs in the groove below V13. The other sphingomyelin molecular 
species also approach the p24 TMD, but steric effects seem to make 
interactions more difficult in these cases. For instance, the SM 14 lipid 
head group rather interacts with Y21 by packing mostly below the side 
chain, which rotates both lipid chains away from the helix and prevents 
efficient packing (Supplementary Movie 2). The lifetime of the SM 18- 
p24 (TMD) complex in the molecular dynamics simulation was of the 
order of 250 ns, which is five times longer than observed for the other 
sphingomyelin species. Likewise, as compared to SM 18, the relative 
dissociation constant of SM 14 was 6.5-fold higher (Supplementary 
Fig. 9). Notably, although the TMD of p24 is highly conserved in 
higher eukaryotes, it is not conserved in yeast, in agreement with the 
absence of sphingomyelin in this organism. 

To search for other candidate sphingolipid-interacting proteins, we 
defined a binding signature where either a §-branched residue (I, T, V) 
or Leu is allowed in any of the first four positions, and an aromatic 
residue (F, W, Y) is allowed in the last position (Fig. 3a). The signature 
represents 4* X 3 = 768 unique sequence motifs, 13 of which were 
found to be overrepresented in a set of mammalian membrane proteins 
(see Full Methods in Supplementary Information). These 13 motifs 
identified 48 candidate proteins (Supplementary Table 1), mostly loca- 
lized to the plasma membrane (Fig. 3b). 

Three recombinant candidates and, as a negative control, the non- 
signature containing asialo-glycoprotein receptor, were expressed in 
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Figure 3 | A conserved sphingolipid binding signature. a, Signature patterns 
used for both randomization/shuffling approach and screening for potential 
sphingolipid-binding proteins (upper panel). All possible combinations were 
analysed for significant overrepresentations in TMDs. Motifs overrepresented 
with a P-value = 0.05 were used to generate the sequence logo (lower panel). 
The letter size corresponds to the probability to find this amino acid at that 
position. Each dot below the line represents an amino acid position. 
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b, Intracellular distribution of signature-containing transmembrane protein 
candidates. c, In vivo labelling of Flag-tagged constructs of human INGR1, 
117RB, B3GT2 and ASGRI. Cells were labelled with (°H]-photo-sphingosine, 
ultraviolet-irradiated, lysed and subjected to IP using an antibody against the 
Flag tag. Radioactivity recovered with protein candidates was visualized by 
autoradiography. The corresponding proteins were detected by western blot 
analysis. Asterisk, expected size of proteins. 
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HeLa cells (Supplementary Fig. 10). In in vivo labelling all three candi- 
dates showed strong binding to a sphingolipid (Fig. 3c). Interestingly, 
INGRI binds to a sphingolipid only upon activation by its ligand 
interferon y. 

What may be the function of the highly specific interaction of SM 18 
with p24? As p24 acts as membrane machinery for the formation of 
COPI vesicles, we analysed if a loss of binding of SM 18 would affect 
transport of biosynthetic cargo'*’. We analysed transport of vesicular 
stomatitis G (VSV-G) protein**”’ to determine the transport rates in 
HeLa cells stably transfected with full-length wild-type p24 or 
p24L17F, N-terminally fused to YFP. Expression of p24L17F caused 
an acceleration of VSV-G protein transport (Fig. 4a), resulting in a 2.5- 
fold increase of the fraction of total VSV-G that reached the plasma 
membrane at t = 45 min (see also Supplementary Fig. 11a, b). 

How can a marked acceleration of transport be caused by a lack of 
binding of SM 18 to p24? A decrease in recycling efficiency would 
cause an increased anterograde transport of this cargo, and should 
affect Golgi export. To test this possibility, we measured Golgi export 
kinetics using fluorescence loss in photobleaching (FLIP). Whereas 
Golgi-associated fluorescence of YFP-wild-type p24 was reduced to 
50% within 4.5 min, the half-life of Golgi residence of YFP-p24L17F 
was significantly prolonged to 7min (Fig. 4b, for controls see 
Supplementary Fig. 11c). To analyse if indeed retrograde transport 
from the Golgi to the ER is affected we monitored trafficking to the 
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Figure 4 | Binding of SM 18 to p24(TMD) affects protein transport and 
triggers p24 dimerization. a, Expression of p24L17F accelerates transport of 
ts-O45-G protein. For each time point and experiment (n = 3), at least 600 cells 
were analysed. CFP-VSV-G, cyan-fluorescent-protein-tagged ts-O45-G 
protein. b, Comparison of the average half lives of decay in fluorescence in 
Golgi of YFP—p24 and YFP-p24L17F. Data represent the mean of n = 22-24 
experiments + s.e. P-value of two-tailed, unpaired t-test < 0.0001 (***) is 
given. c, Expression of YFP-p24L17F reduces toxicity of PE-A. Error bars, 
s.e.m. Statistics: two-tailed, unpaired t-test. d, SM 18:0 induces dimerization of 
p24(TMD) (white bars) but not of p23(TMD) (black bars). TMDs reconstituted 
into proteoliposomes with indicated sphingomyelin species subjected to 
chemical crosslinking (n = 4). Results of paired, two-tailed t-tests are given. 

e, Quantification of homodimers in CHO cells in situ. Error bars, s.e.m. 
Statistics: two-tailed, unpaired t-test. ***P < 0.001 f, Model of p24(TMD) SM 
18 complexes. Left, side view; right, top view. 
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endoplasmic reticulum of Pseudomonas aeruginosa exotoxin A (PE-A). 
The toxin is transported from the Golgi to the endoplasmic reticulum in 
a COPI-dependent manner”. In contrast to cells expressing YFP-p24, 
YFP-p24L17F-cells showed a significant reduction of PE-A-dependent 
inhibition of protein biosynthesis, the readout for exotoxin transport 
to the endoplasmic reticulum (Fig. 4c, for controls see Supplementary 
Fig. 11d) Together, these data imply that efficient retrograde COPI- 
dependent transport depends on binding of p24 to sphingomyelin. 

To test if binding to SM 18 might help organize the dimeric, trans- 
port active state of p24, by affecting its monomer/oligomer equilibrium 
via the TMD, we used a chemical crosslinking assay with proteolipo- 
somes reconstituted from p24 family-TMD fusion proteins and sphin- 
gomyelins of various species compositions. Indeed, dimerization is 
significantly induced only in liposomes containing both SM 18 and 
p24 (Fig. 4d and Supplementary Fig. 12a—c). To analyse if dimerization 
of p24 is also affected in vivo in the presence of full-length p24L17F, we 
monitored dimerization of p24 using an in situ protein-protein inter- 
action assay. Signals were significantly reduced in the presence of 
p24L17F (Fig. 4e, for representative images and western blots see 
Supplementary Fig. 12d). Similar results were obtained when cells 
expressing either YFP—p24 or YFP—p24L17F were subjected to chemical 
crosslinking (Supplementary Fig. 12e). 

How might a complex of dimeric p24 and SM 18 be organized? We 
performed molecular dynamics simulations starting from different 
models based on the sphingomyelin-bound monomeric structures from 
the initial simulations. The most stable model was one with a rather 
polar dimerization interface not overlapping with the sphingomyelin 
binding site (Fig. 4f); this complex remained intact for at least 200 ns. 
Besides a direct role of sphingomyelin in stabilizing a p24 dimer, other 
mechanisms of sphingomyelin-triggered dimerization might apply, 
such as a sphingomyelin-dependent conformational change of p24 or 
build-up of high concentrations of p24 in SM 18 microdomains. 

In summary, we have uncovered an unprecedented specificity of 
interaction between an individual sphingolipid species and a TMD, 
and have defined a structural signature within the TMD for this bind- 
ing. Specific binding of individual lipid species to TMDs of membrane 
proteins may serve different functions and help to understand the need 
for the complexity of membrane lipid compositions at a functional 
level. 


METHODS SUMMARY 

In vivo photoaffinity labelling of CHO cells. Cells were labelled with the differ- 
ent photoactivatable precursors as described’*. Briefly, cells were washed with 
PBS, followed by addition of freshly prepared delipidated medium containing 
lipid precursors. After labelling, the medium was removed and cells were washed 
twice with PBS. After ultraviolet irradiation, cells were collected and lysed in lysis 
buffer (50 mM HEPES-NaOH, pH 7.4, 100mM NaCl, 5mM EDTA, 1% Triton 
X-100 (v/v), 0.5% deoxycholate (w/v) and protease inhibitor cocktail). After lysis, 
samples were subjected to immunoprecipitation. After SDS-PAGE and western 
blotting, radioactively labelled proteins were detected by digital autoradiography 
(f-Imager 2000, Biospace). 

FRET assay. FRET was used to probe the interaction of TMDs with pentaenoyl- 
lipids directly. FRET experiments were conducted on a Jasco 6500 unit spectro- 
fluorometer (Jasco). Proteoliposomes (protein/lipid, 1:5,000) were diluted in a 
quartz cuvette in buffer (10 mM HEPES/KOH pH 7.4) to a final concentration 
of 0.1mM. The proteoliposomes were incubated at 25°C for 5 min under con- 
tinuous stirring. The extent of FRET between TMD and pentaenoyl-lipids was 
determined by recording emission spectra from 310 to 540 nm. Emission spectra 
were collected, exciting at 280 nm the Trp present in the TMD. Slit widths of 5 nm 
were used for both excitation and emission. 

Full Methods and any associated references are available as Supplementary 
information. 
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Structure of the carboxy-terminal region of a KCNH 


channel 


Tinatin I. Brelidze!, Anne E. Carlson!, Banumathi Sankaran? & William N. Zagotta! 


The KCNH family of ion channels, comprising ether-a-go-go 
(EAG), EAG-related gene (ERG), and EAG-like (ELK) K*-channel 
subfamilies, is crucial for repolarization of the cardiac action poten- 
tial’, regulation of neuronal excitability” and proliferation of 
tumour cells*. The carboxy-terminal region of KCNH channels con- 
tains a cyclic-nucleotide-binding homology domain (CNBHD) and 
C-linker that couples the CNBHD to the pore*. The C-linker/ 
CNBHD is essential for proper function and trafficking of ion 
channels in the KCNH family*°. However, despite the importance 
of the C-linker/CNBHD for the function of KCNH channels, the 
structural basis of ion-channel regulation by the C-linker/CNBHD 
is unknown. Here we report the crystal structure of the C-linker/ 
CNBHD of zebrafish ELK channels at 2.2-A resolution. Although 
the overall structure of the C-linker/CNBHD of ELK channels is 
similar to the cyclic-nucleotide-binding domain (CNBD) structure 
of the related hyperpolarization-activated cyclic-nucleotide- 
modulated (HCN) channels’®, there are marked differences. 
Unlike the CNBD of HCN, the CNBHD of ELK displays a negatively 
charged electrostatic profile that explains the lack of binding and 
regulation of KCNH channels by cyclic nucleotides*". Instead of 
cyclic nucleotide, the binding pocket is occupied by a short B-strand. 
Mutations of the B-strand shift the voltage dependence of activation 
to more depolarized voltages, implicating the f-strand as an 
intrinsic ligand for the CNBHD of ELK channels. In both ELK 
and HCN channels the C-linker is the site of virtually all of 
the intersubunit interactions in the C-terminal region. However, 
in the zebrafish ELK structure there is a reorientation in the 
C-linker so that the subunits form dimers instead of tetramers, as 
observed in HCN channels. These results provide a structural 
framework for understanding the regulation of ion channels in 
the KCNH family by the C-linker/CNBHD and may guide the 
design of specific drugs. 

KCNH channels are voltage-gated K* channels that regulate the 
electrical excitability of heart and nerve cells. Similar to other K*- 
selective channels, KCNH channels are composed of four subunits 
surrounding a centrally located pore. Each subunit contains a voltage- 
sensor domain (transmembrane segments $1-S4), and a pore domain 
(transmembrane segments S5-S6 and an intervening pore-forming 
loop)* (Fig. 1a). KCNH channels have a Per-Arnt-Sim (PAS) domain 
in the amino-terminal region and a C-linker and CNBHD in the 
C-terminal region*”* (Fig. 1a). Many of the unique gating properties 
of KCNH channels arise from these intracellular domains®”!*". 
Whereas the structure of the N-terminal region has been solved for 
ERG channels!*!*!’, there is no structural information on the 
C-linker/CNBHD for any of the KCNH channels. 

KCNH channels are part of a large family of cyclic-nucleotide- 
regulated channels that includes the HCN and cyclic-nucleotide-gated 
(CNG) channels (Supplementary Fig. la). Unlike HCN and CNG 
channels'*, KCNH channels are not regulated by direct binding of cyclic 
nucleotides*’'. Instead, it has been suggested that the CNBHD of 
KCNH channels may be an orphan receptor for an as yet unidentified 


channel regulator’’. Indeed, it has been recently shown that mouse 
EAGI channels are regulated by the direct binding of flavonoids to the 
C-linker/CNBHD”’. In addition, the C-linker/CNBHD has been 
shown to interact with the N-terminal PAS domain and regulate 
gating® *. Mutations in both of these regions in human ERG1 channels 
cause long-QT syndrome (LQTS), a heart arrhythmia that can cause 
cardiac death*®. Therefore the C-linker/CNBHD is an important 
regulatory domain in KCNH channels. 

To understand the role of the C-linker/CNBHD in KCNH channel 
function, we sought to determine the X-ray structure of the C-linker/ 
CNBHD ofan ion channel in the KCNH family. Using a screen based 
on fluorescence-detection size-exclusion chromatography (FSEC)”', 


160 mV 


Figure 1 | Topology and electrophysiological properties of zebrafish ELK 
channels. a, Cartoon of two of the four subunits of ELK channels. The pore- 
forming loop and S5-S6 transmembrane domains are grey. The N-terminal 
a-helix and PAS domain are magenta. The ‘elbow and ‘shoulder’ regions of the 
C-linker are represented by the red and pink cylinders, respectively. The «C- 
helix, represented by a cylinder, is green and the rest of the CNBHD is blue. 
b, Currents from zebrafish ELK channels recorded in the inside-out patch 
configuration. 
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we identified the C-linker/CNBHD of Danio rerio (zebrafish) ELK as a 
potential candidate for crystallization. Zebrafish ELK shares substan- 
tial sequence similarity with mammalian ELK channels (Sup- 
plementary Fig. 1c). 

Expression of zebrafish ELK channels in Xenopus laevis oocytes gave 
rise to robust voltage-activated K* currents with electrophysiological 
characteristics similar to the ones reported previously for mammalian 
ELK channels” (Fig. 1b). Zebrafish ELK channels were activated by 
depolarizing voltage steps and showed inactivation at voltages 
>+40mV (Fig. 1b). The half-maximal activation voltage (Vj,)2) was 
—45.3+3.2mV (n=18) with a slope of e-fold per 13.8+0.6mV 
(n = 18) (Supplementary Fig. 1b). In comparison, human ELK2 channels 
activate with Vj. of —22.8 + 0.5 mV andaslope of 18.1 + 0.4mV, and 
inactivate at voltages >+20 mV (ref. 22). As previously observed for 
mouse EAG and human ERG channels*"’, application of cAMP had no 


zELK 


L742 G684 


Figure 2 | Structure of the C-linker/CNBHD. a, Ribbon representation of a 
monomer of the C-linker/CNBHD of zebrafish ELK (zELK) channels. 

b, Alignment of the CNBHD of zebrafish ELK (the «C-helix is green and the 
rest is blue) and mouse HCN2 (grey) channels’®. cAMP in the HCN2 structure 
is yellow. c, d, Electrostatic potential surface of the CNBHD of zebrafish ELK 
(c) and HCN2 channels (d), viewed in the same orientation as in b. e, Residues 
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effect on the currents through zebrafish ELK channels (Supplementary 
Fig. 1b). 

The C-linker/CNBHD of zebrafish ELK channels crystallized in two 
space groups, C222, and P12,1, and diffracted X-rays to 2.2- and 2.3-A 
resolution, respectively (Supplementary Table 1). The structure of the 
zebrafish ELK C-linker/CNBHD was solved by the single-wavelength 
anomalous dispersion (SAD) phasing method using selenomethionine 
derivative crystals. 

The crystal structure revealed that the C-linker of zebrafish ELK 
channels consists of six a-helices (aA’-oF’) with aA’ and «B’ helices 
forming an antiparallel helix—turn-helix motif. The CNBHD consists 
of eight B-strands forming an antiparallel B-roll, three «-helices (A- 
aC), and a short B-strand (9) after the «C-helix (Fig. 2a). The general 
architecture of the CNBHD of ELK is similar to the fold of CNBDs in 
other proteins”®. 


b 


Relative conductance (G/G,,.,,) 


-100 -50 0 50 


Voltage (mV) 
in the B-roll cavity interacting with residues Y740 and L742 of the intrinsic 
ligand. Dashed lines show both polar and non-polar interactions. cAMP from 
the HCN2 structure is shown in yellow. f, Representative conductance-voltage 
relationships for wild-type (black), Y740A mutant (blue) and A740-742 
mutant (green) zebrafish ELK channels. 
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Although the overall folds of the CNBHDs of ELK and HCN2 
channels are similar, superposition of the two structures reveals 
remarkable differences (Fig. 2b and Supplementary Fig. 2a). The root 
mean squared deviation (r.m.s.d.) for the -carbons of the two struc- 
tures (residues 626-740 of zebrafish ELK and residues 523-635 of 
mouse HCN2; ref. 10) is 4.7 A with the largest differences observed 
for the three o-helices in the CNBHD. The «A- and «B-helices of ELK 
are moved away from the cavity formed by the B-roll that serves as the 
cyclic-nucleotide-binding pocket in other CNBD-containing proteins. 
The oA-helix is moved by about 4.5 A and the oB-helix is moved by 
about 3.4A relative to their positions in the cAMP-bound form of 
HCN2. The position of the «A-helix of ELK is similar to the position 
of the xA-helix in cAMP-bound MlotiK channels” and the position of 
the oB-helix is similar to the unliganded MlotiK channels’? 
(Supplementary Fig. 3). The differences in the «C-helix are even more 
marked. The «C-helix of ELK is shorter and bent, and is followed by a 
short B-strand, B9 (Fig. 2b). These differences all reside in the «-helices 
of the CNBD that undergo ligand-dependent conformational changes 
in HCN and CNG channels. 

Out of the ten residues that directly interact with cAMP in HCN2 
channels, only three (V667, L677 and G684) are conserved in zebrafish 
ELK channels (Supplementary Fig. 4). Moreover, most of the residues 
in the phosphate-binding cassette of HCN2, including R591, are not 
conserved, and the «P-helix is missing in zebrafish ELK. The electro- 
static profiles of the CNBHD of ELK and HCN2 channels reveal that 
the putative ligand-binding pocket formed by the B-roll cavity is nega- 
tively charged for ELK but positively charged for HCN2 channels 
(Fig. 2c, d). The negatively charged electrostatic profile would not be 
favourable for binding of a negatively charged cyclic nucleotide. 
Consistent with this, crystallization of zebrafish ELK in the presence 
of 5 mM cAMP did not reveal any new electron density corresponding 
to cAMP (Supplementary Table 1 and data not shown). These differ- 
ences in the CNBD structures probably account, at least in part, for the 
lack of cyclic nucleotide regulation of KCNH channels*". 

The zebrafish ELK CNBHD structure displays another unique 
feature: the 89-strand after the «C-helix forms direct interactions with 
the B-roll cavity in ELK channels (Fig. 2e and Supplementary Figs 2b 
and 5). Interestingly, the phenyl ring of Y740 on the B9-strand is 
positioned in an analogous place to the purine ring of cAMP in 
HCN2 channels and L742 is positioned in an analogous place to the 
cyclic phosphate of cAMP”. Virtually all ion channels in the KCNH 
family have either tyrosine or phenylalanine at the position corres- 
ponding to Y740 in zebrafish ELK and a leucine at position corres- 
ponding to L742 (Supplementary Fig. 1c). These observations raise an 
intriguing possibility that the B9-strand might act as an intrinsic 
ligand, a portion of the protein that occupies the ligand-binding site 
whose displacement regulates the channel. 

To explore a possible regulatory role of the B9-strand, we examined 
the effect of mutations in the B9-strand on the function of intact zebra- 
fish ELK channels. Zebrafish ELK channels with either a point mutation 
Y740A or deletion of the B9-strand (A740-742) exhibited robust 
voltage-activated currents that inactivated at voltages >+40mV, 
similar to wild-type ELK channels (Supplementary Fig. 6a). However, 
for both mutations, the V2 values for activation were significantly 
larger (P< 0.01, Student’s t-test) than the Vj,2 of wild-type channels 
(wild type: —45.3 + 3.2 mV, n = 18; Y740A: —29.8 + 4.4 mV, n = 19; 
A740-742: —28.3 + 4.4 mV, n = 11) (Fig. 2f and Supplementary Fig. 
6b). This 15 mV shift in the voltage dependence of activation in the 
mutants is similar in magnitude to the effect of cAMP on HCN chan- 
nels, lending further support to the possibility that the B9-strand may 
function as an intrinsic ligand for ELK channels. Interestingly, muta- 
tions in the region corresponding to the B9-strand of human ERG 
channels are associated with LQTS”**. Additional experiments will 
need to be done to further test the intrinsic ligand hypothesis. 

Similar to HCN channels, the C-linker of zebrafish ELK channels is 
the primary region of intersubunit interactions in the crystal structure 


532 | NATURE | VOL 481 | 26 JANUARY 2012 


with a buried solvent accessible surface area of about 2,520 A? for each 
subunit (Fig. 3a). The intersubunit interface can be likened to an ‘elbow 
on the shoulder’, where the ‘elbow’ formed by the ~A’- and %B’ -helices 
of one subunit is resting on the ‘shoulder’ formed by the «D’-helix and 
the aC’-aD’ loop of the neighbouring subunit (Figs 2a, 3a, d and 
Supplementary Fig. 7). The structure of the elbow-on-the-shoulder 
interface between neighbouring subunits is very similar between 
ELK and HCN2 channels (Fig. 3d), except that the region before the 
aD’ -helix is not o-helical in ELK. 

Unexpectedly, however, the elbow-on-the-shoulder interface in 
zebrafish ELK occurs with a two-fold related subunit (rotated by 
180°), instead of a four-fold related subunit (rotated by 90°) like in 
HCN channels (Fig. 3a, b). This difference results froma ~55° rotation 
in the region after the «B’-helix in ELK relative to HCN2 channels 
(Fig. 3c). The implication of this marked rotation of the ‘elbow is a 
dimeric assembly of the C-linker/CNBHDs in the zebrafish ELK struc- 
ture (Fig. 3a), as opposed to the tetrameric assembly of the C-linker/ 
CNBDs in HCN channels’® (Fig. 3b). The intersubunit interface and 
the dimeric assembly are preserved in both sets of molecules in the 
asymmetric unit and for both crystal forms of zebrafish ELK 
(Supplementary Fig. 8), suggesting that they are independent of crystal 
contacts. FSEC experiments also revealed that the GFP-tagged 
C-linker/CNBHD of zebrafish ELK channels dimerizes at sufficiently 


Figure 3 | Structural comparison of the C-linker regions and quaternary 
arrangement of the C-linker/CNBHDs of zebrafish ELK and HCN2 
channels. a, A dimer formed by the C-linker/CNBHD of zebrafish ELK (zELK) 
channels viewed perpendicular (left) and parallel (right) to the two-fold axis. 
b, A tetramer formed by the C-linker/CNBD of HCN2 channels viewed 
perpendicular (left) and parallel (right) to the four-fold axis. c, Superposition of 
the ribbon representations of the C-linker/CNBHD monomers of zebrafish 
ELK (yellow) and mouse HCN2 channels (magenta) using the alignment of the 
a-carbons of the CNBDs of these structures. d, Superposition of the elbow-on- 
the-shoulder interface of zebrafish ELK (blue, elbow; yellow, shoulder) and 
mouse HCN2 channels (grey, elbow; magenta, shoulder). 
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high concentrations in solution (Supplementary Fig. 9). Additional 
experiments would be required to determine if the quaternary state 
of the C-linker/CNBHD in intact channels has a two-fold or four-fold 
symmetry (Supplementary Discussion and Supplementary Fig. 10). 

Here we present the first crystal structure, to our knowledge, of the 
C-linker/CNBHD ofa KCNH channel. The structure reveals a putative 
ligand-binding pocket that differs markedly from the cyclic-nucleotide- 
binding pocket of HCN channels and does not appear to bind cyclic 
nucleotides. Instead of a ligand, the putative ligand-binding pocket of 
zebrafish ELK channels is occupied by a novel 9-strand after the «C- 
helix. Mutations of the B9-strand shifted the voltage dependence of 
activation, suggesting that the B9-strand is a regulatory element for 
ELK channels that acts as an intrinsic ligand. The crystal structure also 
displays an unexpected conformation of the C-linker that leads to 
dimerization of the C-linker/CNBHDs of zebrafish ELK channels. 
These findings provide a structural framework to understand the regu- 
lation of KCNH channels by the C-linker/CNBHD. 


METHODS SUMMARY 
Protein purification. The C-linker/CNBHD of zebrafish ELK (amino acids Q543- 
L750) was subcloned into a modified pMalc2T vector (New England Biolabs) 
containing an N-terminal MBP affinity tag followed by a thrombin cleavage site. 
The protein was expressed in BL21 (DE3) Escherichia coli cells as previously 
described", purified on an amylose affinity column and then on an ion-exchange 
column after an overnight cleavage with thrombin at 4 °C. The purified protein was 
concentrated to 20-30 mg ml ' for crystallization. Selenomethionine derivatives 
were generated as previously described’®. 
Crystallography. Crystals were grown at 20°C using the sitting-drop vapour dif- 
fusion method. 150-nl drops of the concentrated protein and reservoir solutions were 
mixed 1:1 by a Mosquito (TTP LABTECH). The final protein solution contained: 
~250mM KCl, 1mM TCEP, 30mM HEPES, 100mM citric acid, pH 3.5. The 
reservoir solution contained: 6% w/v 1,5-diaminopentane dihydrochloride, 
180 mM ammonium acetate, 22.5% (w/v) PEG 3350, 90 mM Tris; pH 8.5 for crystal 
T141; 1.8 M non-detergent sulphobetaine (NDSB)-211, 180 mM ammonium acetate, 
22.5% (w/v) PEG 3350, 90 mM Tris, pH 8.5 for crystal T42; 6% (w/v) D-(+)-galactose, 
180 mM ammonium sulphate, 22.5% (w/v) PEG 3350, 90 mM HEPES, pH 7.5 for 
crystal T26; and 5 mM cAMP, 180 mM ammonium acetate, 22.5% (w/v) PEG 3350, 
90 mM Tris, pH 8.5 for crystal T84. 

The crystallographic data are summarized in Supplementary Table 1 and details 
of the structure determination are described in Methods. 
Electrophysiology. The full-length zebrafish ELK channel (GenInfo Identifier 
(GI) 159570347) with a C-terminal Flag epitope was generated by Bio Basic Inc. 
and subcloned into the pGEMHE oocyte expression vector. Expression of the 
wild-type and mutant zebrafish ELK channels in Xenopus oocytes and current 
recordings were done as previously described’'. Both pipette and bath solutions 
contained 130 mM KCl, 10 mM HEPES, 0.2 mM EDTA, pH 7.2. 1mM cAMP was 
added to the bath solution as indicated. 


Full Methods and any associated references are available in the online version of 
the paper at www.nature.com/nature. 
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METHODS 

FSEC. The C-linker/CNBHD of zebrafish ELK channels (amino acids Q543- 
L750) was covalently fused to a C-terminal GFP in the pCGFP-BC bacterial 
expression vector provided by T. Kawate and E. Gouaux’’. The construct was 
transformed into BL21 (DE3) cells. 5ml cultures of the cells were grown at 
37 °C, induced with IPTG and harvested by centrifugation. The cell pellets were 
resuspended in a lysis buffer (500 mM KCl, 1mM TCEP, 30 mM HEPES, 1 mM 
PMSF and 2.5mgml~' DNase; pH 8.0) and sonicated. Insoluble protein was 
separated by centrifugation and the supernatant was analysed on a Superdex 
200 10/300 GL column (GE Healthcare). 

Scale-up protein purification. The C-linker/CNBHD of zebrafish ELK (amino 
acids Q543-L750) was subcloned into a modified pMALc2T vector (New England 
Biolabs) containing an N-terminal MBP affinity tag followed by a thrombin cleavage 
site. The protein was expressed in BL21 (DE3) Escherichia coli cells as previously 
described''. The cells were harvested by centrifugation, resuspended in a lysis buffer 
(500 mM KCl, 1 mM TCEP, 30 mM HEPES, 1 mM PMSF and 2.5 mg ml | DNase; 
pH 8.0) and lysed in an Emulsiflex-C5 (Avestin). Insoluble protein was separated by 
centrifugation. The C-linker/CNBHD of zebrafish ELK was purified on an amylose 
affinity column and was then loaded on a HiTrap SP FF ion-exchange column 
following an overnight cleavage with thrombin at 4 °C. The protein was eluted with 
a linear KCl gradient and 100 mM citric acid (pH 3.5) was added to the final protein 
solution to increase the solubility of the protein at high concentrations. The protein 
was concentrated to 20-30 mg ml’ for crystallization. Selenomethionine deriva- 
tives were generated as previously described'*”!. 

Crystallization. Crystals were grown at 20°C using the sitting-drop vapour dif- 
fusion method. 150-nl drops of the concentrated protein and reservoir solution 
were mixed 1:1 by a Mosquito (TTP LABTECH). The final protein solution con- 
tained: ~250 mM KCl, 1 mM TCEP, 30 mM HEPES, 100 mM citric acid, pH 3.5. 
The reservoir solution contained: 6% w/v 1,5-diaminopentane dihydrochloride, 
180mM ammonium acetate, 22.5% (w/v) PEG 3350, 90mM Tris; pH 8.5 for 
crystal T141; 1.8 M non-detergent sulphobetaine (NDSB)-211, 180 mM ammonium 
acetate, 22.5% (w/v) PEG 3350, 90 mM Tris, pH 8.5 for crystal T42; 6% (w/v) D-(+)- 
galactose, 180 mM ammonium sulphate, 22.5% (w/v) PEG 3350, 90 mM HEPES, pH 
7.5 for crystal T26; and 5 mM cAMP, 180 mM ammonium acetate, 22.5% (w/v) PEG 
3350, 90mM Tris, pH 8.5 for crystal T84. The crystals were cryoprotected in 
reservoir solution supplemented with 25% glycerol before being flash frozen in 
liquid nitrogen. 

Data collection and structure determination. Diffraction data sets were col- 
lected at the Advanced Light Source (beamline 8.2.1) at Lawrence Berkeley 
National Laboratory in Berkeley, California. Data were analysed with Mosfilm*” 
and HKL2000** software. Molecular replacement using the structure of the 
C-linker/CNBD of HCN2 channels as a search model failed to find a solution. 
Therefore, the structure of the zebrafish ELK C-linker/CNBHD was solved by the 
SAD phasing of selenomethionine derivative crystal T141 using PHENIX”* fol- 
lowed by phase extension to 2.25 A resolution using a native data set. Structures for 
the rest of the data sets were solved by molecular replacement using this structure 
as a search model followed by numerous cycles of refinement in PHENIX and 
manual model building in Coot**. The molecular replacement was carried out 
using Phaser in PHENIX™*. The asymmetric unit contained three molecules in 
the C222, space group and four in the P12;1 space group. The structures of 
different ELK molecules in the asymmetric unit and molecules in the two different 
space groups were very similar, with r.m.s.d. values for the «-carbons calculated for 
the entire sequence of the resolved C-linker/CNBHD ranging from 0.4-0.7 A 
(Supplementary Fig. 8). The crystallographic data and refinement statistics are 
summarized in Supplementary Table 1. Electron density was visible for all but 


several terminal residues in molecules A and B of the native structure in the C222, 
space group. Molecules A and B of the native C222, structure were used for 
analysis in this paper. Analysis with Molprobity”® of the final models indicated 
no Ramachandran outliers for T42 and T26 structures, and 0.59 (%) for T84. 
Figures were made using PyMOL”. The topology of the C-linker/CNBHD of 
zebrafish ELK was defined by PROCHEK (http://www.ebi.ac.uk/pdbsum/). The 
phylogenetic tree in Supplementary Fig. la was computed with Cobalt** (http:// 
www.ncbi.nlm.nih.gov/tools/cobalt/). The GI numbers for the amino acid 
sequences aligned in Supplementary Fig. 1c were: zebrafish ELK, 159570347; 
human ELK1, 27886667; human ELK2, 26006814; human ERG1, 103488986; 
mouse EAGI, 487740; bovine CNGA1, 231739; mouse HCN2, 148699724. The 
electrostatic potential surface calculations were carried out using the APBS” plugin 
for PyYMOL and the PARSE force field, and coloured from red (—3 kT/e) to blue 
(+3 kT/e) where k is Boltzmann’s constant, T is absolute temperature, and e is the 
charge on an electron. 

Electrophysiology. The full-length zebrafish ELK channel (GI: 159570347) with a 
C-terminal Flag epitope was generated by Bio Basic Inc. and subcloned into the 
pGEMHE oocyte expression vector. The cRNA was transcribed using the T7 
mMessage mMachine Ultra kit (Ambion). Expression of the wild-type and mutant 
(Y740A and A740-742) zebrafish ELK channels in Xenopus oocytes and current 
recordings from inside-out patches allowing 10 min for run up following excision 
were done as described before''. Both pipette and bath solutions contained 
130mM KCl, 10mM HEPES, 0.2mM EDTA, pH 7.2. 1mM cAMP was added 
to the bath solution as indicated. Zebrafish ELK currents were elicited by applying 
a series of 100-ms voltage pulses (ranging from —140 to +160mV in 20mV 
increments) from a pre-pulse potential of —140 mV, followed by a 150 ms tail 
pulse to —100 mV. Currents were leak-subtracted with P/4 protocol. To obtain 
conductance versus voltage curves, peak tail current amplitudes at — 100 mV were 
normalized to the largest peak conductance amplitude, which followed a step to 
+40 mV. These normalized data were then plotted against the test voltage, and 
were fit with a Boltzmann equation: 


G 1 
Ginax 1l+e (=) 


Ss 


where V represents the test voltage, V1 2 is the midpoint activation voltage, and s is 
the slope of the relation. 
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CORRECTIONS & AMENDMENTS 


CORRIGENDUM 
doi:10.1038/naturel10789 


Selective killing of cancer cells by a small molecule targeting the stress 


response to ROS 


Lakshmi Raj, Takao Ide, Aditi U. Gurkar, Michael Foley, Monica Schenone, Xiaoyu Li, Nicola J. Tolliday, Todd R. Golub, 
Steven A. Carr, Alykhan F. Shamji, Andrew M. Stern, Anna Mandinova, Stuart L. Schreiber & Sam W. Lee 


Nature 475, 231-234 (2011) 


In this Letter we made errors in the description of the way we per- 
formed tumour measurements in the spontaneous (Fig. 2b and d) and 
the xenograft (Supplementary Fig. 9) mouse models. The legend of 
Fig. 2 incorrectly states that the graph of plotted tumour measurements 
represents tumour diameters from “grossly dissected tumours”. 
Similarly, the Supplementary Fig. 9c legend incorrectly states that 
“After 3 weeks, tumour masses were excised and measured” and the 
legend of Supplementary Fig. 9d incorrectly states that: “After 3 weeks 
treatment, tumour sizes were measured”. Instead, for the experiments 
in both Fig. 2 and Supplementary Fig. 9 we measured diameters of 
tumours in live mice at different time points (as indicated on the x axis 
of the graphs) during the course of the treatment. The tumours were 
measured between the skin surface layers using a caliper. 
Additionally, the legend of Supplementary Fig. 9b and c incorrectly 
refers to “tumour mass(es)” instead of “tumour size(s)”. In the 
MMTV-PyVT spontaneous tumour model (Fig. 2b and d) and in the 
xenograft bladder, breast and lung models (Supplementary Fig. 9b, c 
and d) we measured the length and the width of the tumour. The length 
was measured along the imaginary longitude of the corpse of the mouse 
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Figure 1 | Additional data (volume measurements) for Fig. 2b and d of the 
original paper. Tumour volume measurement in MMTV-PyVT mice. 

a, Tumour volume of single tumours from control (dimethyl sulphoxide, 
DMSO) or piperlongumine (PL)-treated mice were calculated on the basis of 
the length and width of the legions after applying the following formula: 
length X width? X 11/6. b, Similarly, tumour volumes were calculated from 
DMSO-, paclitaxel- or PL-treated mice. Values in bar graphs are 

mean + standard deviation (n = 12). *Shorter treatment period due to high 
toxicity of paclitaxel in animals. 
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and the width was measured in the direction of its latitude. In the 
melanoma xenograft model (Supplementary Fig. 9e) we measured only 
the width owing to the irregular longitudinal shape of the tumours. For 
consistency, the graphs in Fig. 2 and Supplementary Fig. 9 represent 
only the width measurements. 

Furthermore, we defined the error bars in Fig. 2 incorrectly: the 
values in bar graphs in Fig. 2b and d are mean + standard deviation 
(n = 12). We also realize that these error bars were misleadingly small 
when compared to calculated tumour volumes (in cubic centimetres) 
using the following formula: length x width” x 1/6. Tumour volume 
measurements of control and treated single tumours in the MMTV- 
PyVT mouse model are shown below (Fig. 1 of this Corrigendum). 
Although we did observe much higher variability in tumour volumes 
than in tumour width, the magnitude of the difference between 
control and treated animals remains the same. In addition, we made 
errors in indicating the exact number of animals used for each experi- 
ment in Supplementary Fig. 9. The correct numbers are 12 animals 
per group for Supplementary Fig. 9b and seven animals per group for 
Supplementary Fig. 9c, d and e. Values in Supplementary Fig. 9 are 
averages (+standard deviation; n = 12 for Supplementary Fig. 9b, 
n= 7 for Supplementary Fig. 9c, d and e). 

We have also been unable to verify without doubt that the image in 
Supplementary Fig. 9b shows four different mice within the treated 
and untreated groups and therefore wish to replace this figure (see 
Fig. 2 of this Corrigendum). 

We apologize for any confusion these errors may have caused. The 
corrections described here do not alter the overall conclusions of this 
Letter, and all other data still stand. 


PL-treated 


Vehicle-treated 


Figure 2 | This is the corrected Supplementary Fig. 9b. Therapeutic activity 
of PL in xenograft tumour mice bearing EJ bladder cancer cells. Control 
(vehicle)- and PL-treated mice were killed after six weeks of treatment. 
Treatment was 1.5 mgkg ' of PL by intraperitoneal injection daily. 
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TURNING POINT Antarctic researcher finds 
success after break from science p.537 
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Alongside landmarks such as the Forbidden City in Beijing, prospective postdocs will find opportunities in environmental science, palaeontology and other areas. 


The gates are open 


There is a wealth of fellowships and postdoctoral openings in China for foreign researchers 
who aren’t afraid of culture shock. 


BY JANE QIU 


Chinese and still struggled to use chopsticks 

when he first set foot in Qingtongxia, a 
remote village in western China. In 2008, Wick- 
ham, thena PhD student studying insect biology 
at the State University of New York in Syracuse, 
had a fellowship from the East Asia and Pacific 
Summer Institutes (EAPSI), part of the US 
National Science Foundation (NSF). He was 
in China to study the Asian longhorn beetles 
plaguing tens of thousands of hectares of pop- 
lar trees that had been planted to slow desert 
encroachment. The experience gave Wickham 
the chance to network and identify research top- 
ics relevant to both China and the United States. 

His work in the field and his brush with 


Je Wickham knew just a few words of 


Chinese culture made such an impression on 
Wickham that he decided to pursue postdoc- 
toral research in the country. Two years after 
his EAPSI fellowship, he returned to China on 
a three-year postdoctoral grant from the NSF's 
International Research Fellowship Program, 
studying insect species — in particular the 
Japanese pine sawyer beetle — that are pests in 
China and could become invasive if accidentally 
introduced to the United States. He now works 
at the Institute of Chemistry in Beijing, part of 
the Chinese Academy of Sciences (CAS). 


OPPORTUNITIES ABOUND 

Wickham is one of a growing number of 
scientists seeking to jump-start their careers 
in China, attracted by the country’s new 
research infrastructure, expanding scientific 


community and thirst for talent. There were 
127 applicants for the CAS Fellowship for 
Young International Scientists in 2011 — an 
increase of 41% since 2009. About 67% of 
applications are successful. 

China has unique ecological problems that 
make it attractive to environmental scientists, 
plenty of fossils to draw palaeontologists, and 
high investment in clean technology, which 
encourages climate-change research. Post- 
doctoral research in the country, especially in 
these areas, could help young scientists to forge 
collaborations, make contacts and get training 
in specialized techniques. Foreign researchers 
will have to cope with language challenges 
and culture shock, but the experience can be 
enriching both personally and professionally. 

For several years, the Chinese government > 
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Jacob Wickham enjoyed his PhD research in China 
so much that he returned for his postdoc. 


> has been attempting to end the country’s 
brain drain, luring home-grown scientists back 
from overseas and encouraging young talent 
to stay put. More recently, it has also sought 
to draw foreign researchers to its shores. The 
best funding mechanism is usually a fellowship 
(see table), because salaries at Chinese institutes 
are often lower than those in Western coun- 
tries, and Chinese granting mechanisms can't 
easily be applied to foreign scientists. 

CAS and the National Natural Science 
Foundation of China (NSFC) both offer Fel- 
lowships for Young International Scientists 
to freshly graduated PhD holders and more 
experienced postdoctoral researchers of all 
nationalities. Applicants must contact a prin- 
cipal investigator in a Chinese lab, explain their 
project and request that the investigator submit 
an application on their behalf. CAS also runs a 
joint fellowship scheme with the Third World 
Academy of Science (TWAS), to support post- 
docs from other developing countries. 

Foreign agencies run programmes, too. Each 
year, EAPSI provides more than 30 US PhD 
students with first-hand research experience 
in mainland China (it also sends researchers 
to Japan, South Korea, Taiwan, Singapore, 
Australia and New Zealand). Students spend a 
week learning about Chinese language and cul- 
ture in Beijing, before a two-month stint doing 
research at institutes of their choice. They can 
also visit other universities and institutes across 
the country. Options for researchers from out- 
side the United States include the Science and 
Technology Fellowship Programme in China 
(STF China), run by the European Union (EU), 
or initiatives from national funding councils. 

Many foreign postdocs say that working in 
China is far from easy. In daily life, the lan- 
guage barrier is the biggest challenge. “Simple 
tasks, such as opening a bank account and 


going out to do some shopping, are not that 
simple any more,’ says Andy Tsun, a British 
immunologist who has been working at the 
CAS Shanghai Pasteur Institute for two years. 
The EU’s STF China fellows receive six 
months of language and culture training in 
Beijing at the start of their placements, with 
short trips to other parts of China. “It’s a great 
innovation for this kind of programme,’ says 
Sébastian Chanfreau, a French chemical engi- 
neer who spent two years at Nankai University 
in Tianjin. Chinese placements are “certainly 
a daunting experience for people who don't 
speak Chinese and know little about the cul- 
ture and the way of doing things there’, he says. 
However, language is not necessarily a prob- 
lem in the lab or field. Many Chinese research- 
ers speak English well, so foreigners can usually 
communicate with their colleagues. Meetings 
of teams that include international researchers 
are often conducted in English. Of course, more 
complex communications could be tricky — 
foreign postdocs acting as supervisors should 
make sure that instructions are fully under- 
stood, to prevent time or work being lost. 


RED TAPE 

What really baffles most foreign researchers in 
China is how science is done and administrated. 
“Tt definitely takes some time to understand 
how another culture conducts science in aspects 
ranging from grant administration to working 
in the lab and field, and organizing conferences,” 
says Daniel Joswiak, a US glaciologist who has 
been working at the CAS Institute of Tibetan 
Plateau Research in Beijing since 2008. “You 
have to be very proactive.” 

Postdocs often need to take the initiative 
when collaborating or learning about a col- 
league’s work. And researchers may need 
to be forceful to get permits and approvals 
to work in politically sensitive areas such as 
Tibet and Xinjiang. To study pest problems, 
Wickham had to navigate first the State For- 
estry Administration and then the provincial 


FELLOWSHIPS FOR FOREIGNERS 


forestry bureau. Regulations can be vague and 2 
difficult to decipher, and the officials in charge ~ 
may not be in a rush to process the request. It 
is essential to monitor each step of the process. 
“The hierarchical structure can be challeng- 
ing,’ says Wickham. 

Foreign scientists may find it difficult to plan 
field projects ahead of time. In many countries, 
researchers are used to knowing months in 
advance exactly when they are going into the 
field, where and for how long. But often in 
China, “everything is done at the last minute in 
terms of planning field sites and working with 
the local government’, says Wickham. 

In the lab, however, 


“It definitely forward planning is 
takes some time essential: reagents 
to understand often need to be 
how anather shipped from over- 
culture conducts °° Which might 


take more than a 
month. This can be 
frustrating for West- 
ern scientists who are used to next-day deliver- 
ies. “You have to plan ahead,’ says Tsun. “If you 
have a change in project direction, it will take a 
while to get things going” 

There are also restrictive rules for importing 
and exporting cell lines and transgenic animals, 
which can also slow things down and compli- 
cate international collaborations. “You need to 
be able to share reagents in science. It’s also a 
way of validating results,’ says molecular biolo- 
gist Jannie Danielsen, who has been working 
at the CAS Beijing Institute of Genomics since 
early 2011, on a fellowship from the Danish 
Council for Independent Research. 

The Chinese work ethic often makes an 
impression on foreign researchers. “It’s hum- 
bling to see people working so hard,” says 
Tsun. He says that half the Pasteur Institute is 
sometimes still in the lab after 7 or 8 p.m. — 
something that Tsun rarely saw during his PhD 
research at the University of Oxford, UK. 

But hard work doesn't always translate into 


science.” 


Agencies in China and abroad offer funding for young scientists hoping to explore the country. 


Programme Target Duration 
CAS Fellowships for Young Postdocs | 1 year 
nternational Scientists 

SFC Fellowships for Young | Postdocs | 6 months 
nternational Scientists to 1 year 
TWAS-CAS Fellowship Postdocs | 6 months 
Program to 1 year 
EU Science and Technology | Postdocs | 2 years 
Fellowship in China 

SF International Research | Postdocs | 3 years 
Fellowship Program 

SF EAPSI Graduate | 2 months 

students 
ext Step Connections Alllevels | 1 to6 
months 
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Website Notes 

go.nature.com/ | Renewable; open to all foreign 
ectbcu postdocs 

go.nature.com/ | Renewable; open to all foreign 
SUCYXX postdocs 


go.nature.com/ 
u7kuau 


For postdocs from developing 
countries 


go.nature.com/ | For EU citizens 


julvme 


go.nature.com/ | For US citizens or permanent 


ms7yb8 residents 

go.nature.com/ | For US citizens or permanent 
eff32a residents 

go.nature.com/ | Commercial programme 
qrncrb offering professional internships 


in Asia to students and young 
professionals of all nationalities 


creativity. Many of the students “haven't been 
trained so much in using their knowledge 
to generate new ideas and find new solu- 
tions’, says Danielsen. “They work extremely 
hard and very long hours, but I am not sure 
whether they are able to step back a bit and 
reflect on the results” Wickham says that the 
science is often highly managed by profes- 
sors, and researchers are not encouraged to 
take risks or learn from their mistakes. 


A DIFFERENT WORLD 

Foreigners may find that some practices 
are anathema to their usual customs. Most 
academic institutions in China offer finan- 
cial rewards for getting papers published in 
journals with high impact factors — often 
thousands of dollars for the first and cor- 
responding authors (see Nature 441, 792; 
2006). Such policies threaten to make com- 
petition unhealthy and discourage people 
from working together and exchanging 
ideas, says Sarah Rothenberg, an environ- 
mental scientist who has just joined the 
University of South Carolina in Columbia 
after three years at the CAS Institute of Geo- 
chemistry in Guiyang. “Science has become 
a totally different game there,” she says. 

Despite the differences, all the research- 
ers contacted for this article say that their 
Chinese colleagues went out of their way 
to make them feel welcome and help them 
to sort out logistical issues such as hous- 
ing. The country’s ample research funding 
also helps. “There are problems with money 
everywhere but China,” says Chanfreau. 
“I was often told, ‘Money is not a problem, 
just get what you need’.” 

Some foreign postdocs have taken the 
opportunity to experience Chinese cul- 
ture outside their research. Chanfreau, for 
instance, helped to initiate an EU-China 
science-communication project in Beijing, 
akin to the international Café Scientifique, 
in which researchers and the public meet to 
debate topical issues such as green chemis- 
try, breast cancer and genetic engineering. 
“The sense of being able to contribute to the 
public understanding of science in China is 
extremely rewarding,’ says Chanfreau. 

The postdocs who find success will be 
those who are open to a new environment 
and eager to explore different approaches 
to science. “A sense of humour and the 
willingness to be flexible are crucial,” says 
Corwin Sullivan, a palaeontologist who 
went to China to pursue a postdoc at the 
CAS Institute of Vertebrate Paleontology 
and Paleoanthropology in Beijing in 2005 
and is now an associate professor there. 
“The expats who find China most difficult 
are those who have a rigid sense of how 
things should work and refuse to adapt: = 


Jane Qiu is a freelance writer based in 
Beijing. 


TURNING POINT 
Rob McKay 


Rob McKay, a glacial sedimentologist at the 
Victoria University of Wellington in New 
Zealand, won the 2011 Prime Minister's 
MacDiarmid Emerging Scientist prize in 
December for research on Antarctica’ climate 
and environmental history. 


Why did you leave science after completing a 
master’s degree in geology? 

I did my master’s with Peter Barrett at Victoria 
University. I went on my first trip to Antarctica 
with him, and I really enjoyed being in the field. 
But after I finished in 2000, I had hefty student 
loans and decided to make some money and see 
the world. I ended up in the United Kingdom, 
using the critical-thinking skills honed during 
my graduate research to edit research reports at 
an investment company. 


What lured you back to do a PhD? 

Thad kept in touch with Peter, intending to write 
my master’s results into a paper. But in 2005, he 
invited meto do PhD research with the Antarc- 
tic Geological Drilling project (ANDRILL), a 
multinational collaboration to investigate past 
climate change. I had concerns about doing a 
PhD at the same university that I earned my ear- 
lier degrees from. But the geology department 
had expanded to run the Antarctic Research 
Centre and had a greater focus on international 
collaboration. I decided that pursuing a PhD 
there would be a good career move. 


Describe your role in the ANDRILL research. 

I was meant to focus on sedimentary petrol- 
ogy, working out the origin of sediments in 
1-metre cores from the Antarctic ice shelf. But 
ANDRILL found evidence of past cycles of ice- 
sheet expansion and retraction coming out of 
this one-of-a-kind 1.3-kilometre drill hole. We 
found 60 cycles of an alternating pattern. Doc- 
umenting and interpreting these cycles became 
one of the more important parts of the project, 
and I worked on that. Our team’s findings con- 
firmed that the ice sheet was highly variable, 
which had been the subject of speculation. 


What did you learn from that experience? 

You have to grasp the opportunities presented 
to you, even if they are high-risk, high-reward. 
Still, I realized that I should have a back-up 
plan. Luckily, the 1.3-kilometre drill hole was 
that plan, and it proved to be quite fruitful. 


How have international collaborations helped 
your career? 

Ive spent 2-3 months at a time on expedi- 
tions with people from the United States, 


CAREERS 


Europe and Japan. These trips are bonding 
experiences, which I think facilitates contin- 
ued collaborations — invitations to labs all 
over the world. Big interdisciplinary projects 
bring together hugely diverse groups of peo- 
ple. My work on ANDRILL led to a paper in 
the Geological Society of America Bulletin (R. 
McKay et al. Geol. Soc. Am. Bull. 21, 1537- 
1561; 2009), which received a lot of attention 
from the palaeoclimate community because 
it was one of the most detailed records yet 
published of Antarctic climate history and 
fluctuations in the ice sheet. That led to an 
invitation to take part in the Integrated Ocean 
Drilling Program, an international marine- 
research initiative. I’m taking my palaeo- 
climate work from the ice sheet to the ocean, 
trying to determine how ocean chemistry 
dynamics affect climate. 


How will the Emerging Science prize affect 

your career? 

It certainly will help with my imminent job 
search. It’s prestigious and helps to bring atten- 
tion to climate research. I hope to use some of 
the NZ$200,000 (US$160,000) prize money to 
get another PhD student involved in my next 
project, which is likely to involve sea-level 
histories around New Zealand, and to fund 
collaborative research with an overseas lab. 


To what do you attribute your success? 

There is an element of luck. I was fortunate 
to work on this amazing drill hole that had 
features that had never been seen before. ’m 
also really interested in the research, and have 
learned to overcome intimidation to put my 
ideas out there. = 


INTERVIEW BY VIRGINIA GEWIN 


26 JANUARY 2012 | VOL 481 | NATURE | 537 


© 2012 Macmillan Publishers Limited. All rights reserved 


Ua SCIENCE FICTION 


ROCK ‘N’ ROLL ALIENS 


BY RALPH GRECO JR 


and catch Dr Malacuplus before he 

made off with the orb. The Latex Kids 
were blistering through their latest seven- 
minute epic guitar assault, and Sinthia 
Slide, the band’s lead singer and my third 
cousin once removed, was adopting her 
usual stage pose, channelling one-quar- 
ter Janis Joplin, one-quarter Rip Taylor, 
another part classic Medusa and 
one-quarter that tiger who had 
had enough with Siegfried and 
Roy. As she scared/delighted/ 
aroused the crowd, I realized 
that my only chance was to use 
her in a way that would’ve had 
her snapping her whip at me if I 
hadn't been the band’s security, 
confidant and benefactor. As 
Sinthia set her hips, spread her 
thighs (as best she could in what 
she wore) and unfurled her red 
latex dress down to her knees, 
she created for me a tighter- 
than-tight trampoline. I leapt 
from my perch atop the wall of 
Marshall stacks, bounced off the 
front of Sinthia’s dress and cata- 
pulted over the bands’ collective 
heads straight on to the back of 
that bald, smelly doctor. 

Of course, most of the audi- 
ence at the Steam Works, an 
outpost of 7th Avenue and 
nowheresville New Jersey, 
thought my aerobatics were part 
of the show. The LKs truly deliver 
for their fans. That a six-foot guy 
with a crew cut, jack-boots and 
split rivet jeans could come tumbling down 
from some amps, bounce off the petite lead 
singer as she held her dress out, shoot across 
the stage and land on the arch-nemesis of 
our entire lifestyle community — the doctor 
liked to steal much more than just off-world 
glowing orbs of power — prompted a five- 
minute round of applause and not one raised 
eyebrow. All of which went far in bolstering 
my good feelings for the people who, like me, 
are into latex, leather and lace. 

“You said he was bound to show before the 
end of the season,’ Sinthia said as the band 
and I congregated backstage, sipping a round 
of cold ones. 

“T hoped putting the orb out there in full 
view would pull him out.” 


I had just seconds to leap across the stage 


The looks and the lifestyle. 


“Amazing half-gainer off my pelvis, 
cousin,’ she added. 

The other three band members — drum- 
mer Trudy Sticks, her bassist brother 
Randee (no last name), and the truly 
gargantuan, gaunt and to all intents and 
purposes mute (he truly let his instrument 
speak for him most of the time) lead guitar- 
ist Larry Candy Cane — and I gazed at the 
lead singer as she made her way across the 
room in her six-inch-heel patent-leather 


boots. Being together two years, sharing 
dressing rooms and toilets as we had, and 
seeing one another in every which way dur- 
ing our rise to the toppermost-poppermost 
in this wacky modern music biz, Sinthia’s 
body was still a wonder to any of us who 
spied it bare. But to me what was even more 
amazing, and a testament to what I really 
was all about, was that I stared at her boots 

more than her butt. 
The band had a third set yet to deliver, 
and as the unbuckling, drying off and re- 
lubing process for 


> NATURE.COM their next round of 
FollowFutureson — rock fetish-wear began 
Facebook at: (though The Cane, as 
go.nature.com/mtoodm © we called him, kept his 
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red leather vest and jodhpurs on), I knew 
I could relax a bit. I had dispatched the evil 
doctor, secreted him in the back of my van 
after a quick knockout punch, and I once 
more had the orb safely tucked under my 
long trench. The members of LK were never 
as comfortable staring at the thing as they 
were Sinthia’s butt — seeing as the orb was 
not of this world. 

I was picked as the orb’s keeper because 
the aliens who made it saw I ran with a 
community. Our goth-that-was- 
fetish-that-was-rock-and-roll 
crowd was not easily unnerved 
and was one they wanted to 
infiltrate before they made their 
presence known further to us 
humans. The charge for the pow- 
ers the orb gave me — a mere 
human — was a fee I was collect- 
ing as I bent to grab the band’s 
sweaty and slightly worn stage 
clothes. Our alien friends wanted 
as many outfits made of weird 
Earth materials as possible — 
my part-time job when not with 
LK was unloading shipments at 
The Gap — and I shipped what I 
gathered, as I would the evil doc- 
tor, off Earth via the wonderful 
transportation device I had hid- 
den in my loft bathtub. I never 
asked for what purpose the scaly 
little creatures with the red eyes 
used the band’s old sweaty latex, 
bent bruised buckles and leather, 
but as I had been promised that 
the orb would envelop us allina 
rather splendiferous, fortunate 
glow — as it truly seemed to have, 
as far as this band was concerned 
— LK’s ever-increasing wardrobe allowance 
helped us all along nicely. 

I left the dressing room and pulled my 
trench tight round my bare chest, content 
in the knowledge that I had once again rid 
the Universe of another bad character — and 
was collecting some smelly latex and leather 
for a race more enlightened than our own. = 


Ralph Greco Jr is an internationally 
published author of short stories, plays, 
essays, 800 sex ‘scripts’, children’s songs and 
SEO copy. Ralph is also an ASCAP licensed 
songwriter/performer and Internet radio DJ. 
Living in the wilds of suburban New Jersey, 
USA, Ralph attempts, yet fails, to keep his 
ever-expanding ego in check. 
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Adenylylation control by intra- or intermolecular 
active-site obstruction in Fic proteins 


Philipp Engel'+*, Arnaud Goepfert'*, Frédéric V. Stanger! 
& Christoph Dehio! 


Fic proteins that are defined by the ubiquitous FIC (filamentation 
induced by cyclic AMP) domain are known to catalyse adenylyla- 
tion (also called AMPylation); that is, the transfer of AMP onto a 
target protein. In mammalian cells, adenylylation of small 
GTPases through Fic proteins injected by pathogenic bacteria 
can cause collapse of the actin cytoskeleton and cell death’. It is 
unknown how this potentially deleterious adenylylation activity is 
regulated in the widespread Fic proteins that are found in all 
domains of life and that are thought to have critical roles in 
intrinsic signalling processes. Here we show that FIC-domain- 
mediated adenylylation is controlled by a conserved mechanism 
of ATP-binding-site obstruction that involves an inhibitory 
a-helix (G;,,) with a conserved (S/T)XXXE(G/N) motif, and that 
in this mechanism the invariable glutamate competes with ATP 
y-phosphate binding. Consistent with this, FIC-domain-mediated 
growth arrest of bacteria by the VbhT toxin of Bartonella schoen- 
buchensis is intermolecularly repressed by the VbhA antitoxin 
through tight binding of its ;,;, to the FIC domain of VbhT, as 
shown by structure and function analysis. Furthermore, structural 
comparisons with other bacterial Fic proteins, such as Fic of 
Neisseria meningitidis and of Shewanella oneidensis, show that Oink 
frequently constitutes an amino-terminal or carboxy-terminal 
extension to the FIC domain, respectively, partially obstructing 
the ATP binding site in an intramolecular manner. After mutation 
of the inhibitory motif in various Fic proteins, including the human 
homologue FICD (also known as HYPE), adenylylation activity is 
considerably boosted, consistent with the anticipated relief of inhibi- 
tion. Structural homology modelling of all annotated Fic proteins 
indicates that inhibition by @;,), is universal and conserved through 
evolution, as the inhibitory motif is present in ~90% of all putatively 
adenylylation-active FIC domains, including examples from all 
domains of life and from viruses. Future studies should reveal how 
intrinsic or extrinsic factors modulate adenylylation activity by 
weakening the interaction of G1, with the FIC active site. 

In two Fic proteins, IbpA and VopS, that are translocated by patho- 
genic bacteria into host cells, the ubiquitous FIC domain has been 
shown to catalyse adenylylation’~. The crystal structure of the effector 
domain IbpA(FIC2) in complex with its adenylylated host target Cdc42 
has been reported? and a catalytic mechanism has been proposed”®. 
IbpA- or VopS-mediated adenylylation of Rho-family GTPases 
abolishes downstream signalling in human cells and, thus, causes actin 
cytoskeleton collapse and cell death'”. By contrast, overexpression of a 
human Fic protein with similar target specificity, HYPE, had only a 
marginal effect’. This suggests that the potentially deleterious adenyly- 
lation activity is tightly regulated in HYPE and probably in most of the 
almost 3,000 Fic proteins that are proposed to have important roles in 
intrinsic signalling processes in bacteria, archaea and eukaryotes. 

VbhT is a bacterial Fic protein of the mammalian pathogen 
B. schoenbuchensis’*. It is composed of an N-terminal FIC domain 


, Alexander Harms!, Alexander Schmidt’, Tilman Schirmer? 


and a C-terminal BID domain (Fig. 1a). The BID domain facilitates 
protein translocation into mammalian or bacterial target cells through 
a type IV secretion system or conjugation machinery’, respectively, but 
the target cell and functional role of VbhT are unknown. VbhT arrests 
growth when expressed in Escherichia coli (Fig. 1c and Supplementary 
Fig. 1a). Growth arrest is repressed by mutation of histidine to alanine 
(VbhT(H136A)) in the conserved FIC motif HXFX(D/E)GNGRXXR. 
In other Fic proteins, this signature motif has been shown to be essential 
for target protein adenylylation activity’*"°, therefore suggesting that 
toxicity is related to adenylylation of endogenous proteins. Indeed, 
wild-type VbhT, but not VbhT(H136A), catalysed in vitro adenylyla- 
tion of a putative E. coli target protein of approximately 80 kilodaltons 
(kDa) (Fig. 1d). Furthermore, E. coli cells showed filamentation after 
expression of wild-type VbhT, but not of VbhT(H136A) (Supplemen- 
tary Fig. 2). A similar phenotype has been described for what is thought 
to be a hyperactive mutant of the E. coli Fic protein''. Co-expression of 
VbhT with VbhA, encoded by the small open reading frame vbhA 
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Figure 1 | The small protein VbhA represses the toxic effect (growth arrest) 
that is mediated by the adenylylation activity of VbhT in E. coli. a, Genetic 
organization of the type-IV-secretion-system-associated locus of B. 
schoenbuchensis, which is composed of the overlapping vbhA and vbhT genes. 
The FIC and BID domains that are encoded by vbhT are shown in different 
colours. Protein translations of the regions encoding the conserved motif of 
VbhA and the FIC motif of VbhT are depicted. b, Protein alignment of VbhA 
and a subset of the 158 homologues that are encoded upstream of fic loci in 
different bacteria (see also Supplementary Fig. 4). The most conserved region 
shows a (S/T)XX(I/L)EG consensus. Sequence accessions and strain 
designations are given in Supplementary Fig. 4. c, Growth of E. coli on lysogeny 
broth (LB) plates after IPTG-induced expression of different VbhT and VbhA 
constructs. Expression of VbhT shows a toxic effect, whereas bacterial growth is 
not affected when VbhT(H136A) is expressed or when VbhT and VbhA are co- 
expressed (VbhT-VbhA). All E. coli strains revealed normal growth without 
induction. Growth curve experiments in LB gave analogous results 
(Supplementary Fig. 1). d, Adenylylation assay with crude cell lysates of E. coli 
ectopically expressing the same constructs as in panel c. 
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immediately upstream of vbhT (Fig. 1a), completely repressed VbhT 
toxicity, as shown by wild-type-like bacterial growth, normal cell 
morphology, and inhibition of VbhT-dependent adenylylation (Fig. 1 
and Supplementary Figs 1a and 2). We also observed VbhT-mediated 
toxicity and its repression by VbhA in B. schoenbuchensis, the natural 
carrier of this toxin and antitoxin, and in the related species Bartonella 
henselae (Supplementary Fig. 3). 

The inhibitory action of VbhA on the VbhT toxin, and the genetic 
organization of the respective genes in an operon are reminiscent of 
toxin-antitoxin modules that are found in many bacterial genomes, 
often associated with mobile genetic elements’*. A comprehensive ana- 
lysis of the upstream region of FIC-domain-encoding genes (PFAM 
pf02661) identified 158 bacterial vbhA homologues that probably 
function as antitoxins. Although the sequences are rather diverse, a 
central (S/T)XX(I/L)EG motif is conspicuous (Fig. 1b and Sup- 
plementary Fig. 4). The high-resolution (1.5 A) crystal structure of 
VbhA in complex with the FIC domain of VbhT (VbhT(FIC)) (Sup- 
plementary Table 1) shows that VbhA is folded into three anti-parallel 
helices that tightly embrace VbhT(FIC) (Fig. 2a) with the N-terminal 
helix (inh), adopting a location that is analogous but distinct to that of 
the antitoxin Phd in its complex with Doc’* (Supplementary Fig. 5). 
Doc is a Fic protein with a degenerate, probably adenylylation- 
incompetent FIC motif" that may have adopted another toxic activity 
(Supplementary Information, section 1). The VbhA antitoxin motif 
locates to the C-terminal part of &,, and is positioned close to the 
putative ATP-binding site'® at the N-cap of the helix that follows the 
active loop of VbhT(FIC). This suggests that the antitoxin competes 
with ATP binding. VbhA residues Ser 20 and Glu 24 of the inhibitory 
motif form a hydrogen bond and a salt bridge, respectively, with the 
conserved Argl47 of VbhT following the active loop. 
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Figure 2 | Structures and classification of Fic proteins according to the 
position of the inhibitory motif along the polypeptide chain. Structures are 
shown in cartoon (FIC core as defined by PFAM, red; active site loop with 
histidine, yellow; inhibitory helix (o,,) with SXXXE(G/N) motif, green) with 
important residues in full. a, Complex of VbhT(FIC) with antitoxin VbhA 
(green). b, SoFic (from S. oneidensis, PDB code 3EQX) with C-terminal domain 
omitted. c, NmFic (from N. meningitidis, PDB code 2G03). d, Linear 
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i Two classes 


Intriguingly, structural comparison with other bacterial Fic proteins 
of known fold (Fic proteins from S. oneidensis (SoFic)'*"® and from 
N. meningitidis (NmFic)* (Fig. 2b, c), and from Bacteroides theataio- 
taomicron (BtFic)'* and from Helicobacter pylori (HpFic)'* (Sup- 
plementary Fig. 6)) reveals that a structural equivalent of oy, can be 
part of the FIC domain fold itself. Moreover, these proteins also show the 
SXXXE(G/N) inhibitory motifs that are, with respect to the FIC active 
site, arranged exactly as in the VbhA-VbhT(FIC) complex. Along the 
polypeptide chain, however, these 4;,,, occur at two distinct locations 
either in the N-terminal part (SoFic and BtFic) or at the C terminus 
(NmFic and HpFic). Thus, Fic proteins containing ;,,, can be grouped 
into three classes (Fig. 2d) depending on whether 0;,,, is provided by an 
interacting antitoxin (class I) or whether it is part of the FIC fold as an 
N-terminal helix (class IT) or a C-terminal helix (class II). 

To investigate the distribution of class II and class III Fic proteins, we 
predicted the structures of all PFAM FIC domain entries by homology 
modelling (Supplementary Information, section 1). Including the class I 
proteins that are identified above, two-thirds of all Fic proteins were 
classified (Fig. 2e and Supplementary Table 2), with a strong dominance 
of class II, the class to which human HYPE belongs (Supplementary 
Tables 3 and 4). The proportion of classified Fic proteins increases to 
90% when considering only adenylylation-competent Fic proteins that 
are defined by compliance with the HXFX(D/E)GNGRXXR motif 
(Supplementary Text). This suggests co-evolution of catalytic and 
inhibitory function. The inhibition motifs that are derived from Psi- 
Blast (class I) and structural predictions (classes II and III) are shown in 
Fig. 2f with the overall consensus being (S/T)XXXE(G/N). The strict 
conservation of the glutamate is striking and indicates that the observed 
ionic interaction with the second arginine of the conserved FIC motif 
(Figs 2a—c) is crucial for inhibition. Phylogenetic distribution of class I 
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and the subset of Fic proteins with the conserved FIC motif HXFX[(D/ 
E)GNGRXXR that is likely to convey adenylylation activity (right; see also 
Supplementary Information, section 1). f, Sequence profiles for the inhibition 
site of the three Fic protein classes. 
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and class III Fic proteins indicates that each is of monophyletic origin 
(Supplementary Fig. 7). Fic proteins with a degenerate FIC motif are 
dispersed over the tree, with the exception of the large cluster of Doc- 
like toxins. This suggests that there is recurrent degeneration of the 
conserved FIC motif with concomitant loss of adenylylation activity. 
Consistent with this, deterioration of the FIC motif seems to correlate 
with the absence of a recognizable inhibitory motif (Supplementary 
Fig. 7). Fic proteins with a degenerate FIC motif may display catalytic 
activities different from adenylylation, such as phosphocholination, as 
reported for the Legionella pneumophila effector AnkX"*. 

Owing to our discovery of the prevalence of the inhibitory motif in 
Fic proteins, we carried out a detailed analysis of its functional and 
structural role. For this, NmFic (class III, Fig. 2c) was chosen, as it is the 
smallest active Fic family protein with known crystallization condition 
(Supplementary Table 1). As reported before®, NmFic exhibits in vitro 
auto-adenylylation activity (Fig. 3d). The acceptor site was traced to 
Y183 of On, by mass spectrometry (Supplementary Information, 
section 1, and Supplementary Fig. 8). On the basis of the location of 
Y183 relative to the active site, auto-adenylylation is probably cata- 
lysed intermolecularly after partial unfolding or detachment of Onn. 
Addition of E. coli lysate to NmFic did not reveal additional bands on 
the autoradiograph, and this indicated that there are no NmFic targets 
in E. coli or that the activity of NmFic is inhibited. The latter was shown 
to be correct as mutation of the inhibitory motif (S182A/E186A, 
NmFic(SE/AA)) resulted in transfer of radioactivity onto an ~80-kDa 
E. coli protein and enhanced auto-adenylylation with an additional 
acceptor site (Y188; Fig. 3d and Supplementary Fig. 9). Deletion of the 
entire Onn helix (NmFic(A8)) led to similar target protein adenylylation, 
proving that the activity resides in the FIC domain core. However, only 
weak auto-adenylylation was apparent owing to the lack of the acceptor 
tyrosines in this deletion mutant (Fig. 3d and Supplementary Fig. 8c). 

To investigate the inhibitory mechanism, crystals of NmFic proteins 
were soaked with the non-hydrolysable ATP analogue adenylyl 


Figure 3 | Structure and function of wild-type and mutant NmFic reveals a 
general inhibition mechanism corroborated by HYPE protein analysis. 

a, Active site of NmFic with bound ATP analogue AMPPNP. The y-phosphate 
seems to be disordered and has not been modelled. b, Active site of NmFic(A8) 
with bound AMPPNP and Mg” *. The y-phosphate occupies the position taken 
by E186 of the SXXXEG motif in the wild-type (as shown in a). Also shown are 
2Fo — Fc maps that are contoured at 1.20 and cover the ligands. c, Surface 
representation of the active site of NmFic(A8) with modifiable Y32 (yellow) of 
the Cdc42 switch1 loop (green) in a position that is obtained from a 
superposition of the active site loops of NmFic over the IbpA(FIC2)—Cdc42 
complex’. d, Autoradiography of an SDS gel after incubation of wild-type and 
mutant NmFic with [«-*?P]ATP in the presence or absence of E. coli lysate. The 
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imidodiphosphate (AMPPNP) (Supplementary Table 1). The NmFic- 
AMPPNP structure revealed nucleotide binding but with the y- 
phosphate disordered (Fig. 3a). Notably, the orientation of the o- 
phosphate seems to be non-productive, as the position that is in line 
with the scissile Pa-O3« bond is occluded by H107 and N113 (Sup- 
plementary Fig. 10a and Supplementary Movie). To reveal the situ- 
ation in an inhibition-relieved mutant, the structure of NmFic(SE/AA) 
was determined to 3.0 A (Supplementary Fig. 11). Electron density was 
lacking for on, indicating disorder, whereas the nucleotide conforma- 
tion was well defined. In the NmFic(A8)-AMPPNP structure, the 
same nucleotide conformation was observed (Fig. 3b) and, owing to 
its high resolution (1.7 A), the structural basis for the observed relief of 
inhibition in these mutants became evident. Whereas the adenosine 
moiety adopts the same position as in the wild-type, the y-phosphate 
of the nucleotide is bound to R118, occupying the same position as the 
carboxylate of the inhibitory E186 in the wild type. As a consequence, 
the «-phosphate is found to be re-oriented, and the new orientation 
permits in line attack of a target side chain onto the «-phosphorus to 
accomplish AMP transfer (Fig. 3c, Supplementary Fig. 10 and Sup- 
plementary Movie). 

The exact role of the inhibitory glutamate was investigated further 
by mutagenesis of Fic proteins from the three regulatory classes. In 
wild-type NmFic, Co and CB of the glutamate are close to the position 
that is attained by the y-phosphate position in NmFic(A8) (Fig. 3c). 
Still, an E to G single point mutant may provide sufficient space and 
main-chain flexibility to allow y-phosphate binding. Indeed, similar to 
NmFic(A8)and NmFic(SE/AA), the mutant NmFic(E186G) resulted 
in growth inhibition of E. coli (Supplementary Fig. 1b). Likewise, co- 
expression of VbhT(FIC) and VbhA(E24G) as representatives for 
class I, caused E. coli growth defects (Supplementary Fig. 1a). We also 
included SoFic, a bacterial class II protein, in this analysis. Consistent 
with the effects of E to G single point mutants in NmFic and VbhA, 
mutant SoFic(E73G) revealed a negative effect on E. coli growth 
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(Supplementary Fig. 1c) and strongly enhanced auto-adenylylation 
(Supplementary Fig. 12). A similar marked effect on auto-adenylyla- 
tion was observed after mutation of the corresponding residue in 
human HYPE (HYPE(FIC/E234G)), another class II protein (Fig. 3f, 
g). It is known that HYPE catalyses Cdc42 and Racl adenylylation’’. 
The rather low adenylylation activity of wild-type HYPE on these 
substrates is markedly enhanced in the HYPE(FIC/E234G) mutant 
(Fig. 3f, g), demonstrating that the relief of inhibition by %;,, enhances 
not only auto-adenylylation but also AMP transfer onto bona fide 
protein targets. 

From this study a general mechanism for the inhibition of the FIC- 
domain-mediated adenylylation has emerged that invokes a glutamate 
finger from On, to inter- or intramolecularly block part of the ATP 
binding site (Fig. 3e). Dissociation of the toxin-antitoxin complex 
(class I) or intramolecular loosening of the contact between o,, and 
the FIC domain active site (classes II and III) allows ordered binding of 
the entire ATP moiety with the «-phosphate in an orientation that is 
productive for accepting an incoming target hydroxyl side chain. 

Class I proteins might exert functions similar to the classical 
bacterial toxin-antitoxin modules’, whereas class II and III Fic 
proteins seem to regulate intrinsic cellular functions that are related 
to physiological adaptation and cell homeostasis. Conservation of class 
II Fic proteins in all three kingdoms of life (Supplementary Table 2) 
emphasizes the important role of the regulatory mechanisms described 
here. How the adenylylation activity is activated by weakening the 
interaction of o,,, with the FIC active site in class II and class III Fic 
proteins is one of many important questions for future research. We 
also anticipate our study to be a starting point for rational approaches 
to modulate the adenylylation activities of Fic proteins—approaches 
that should aid in elucidating the diverse biological functions of these 
widespread signalling proteins. 


METHODS SUMMARY 


Toxicity experiments were performed with E. coli strain MG1655, which encodes 
an IPTG (isopropyl-f-p-thiogalactoside)-inducible T7 polymerase (strain 
AB472). Protein expression was controlled by the addition of either 100 1M 
IPTG (induction) or 1% glucose (repression). A PSI-BLAST search for homolo- 
gues with VbhA of B. schoenbuchensis was conducted to identify Fic proteins 
belonging to class I. Class II and class III proteins were classified by structure 
predictions using the program HHpred’’. We analysed all 2,189 proteins of the 
FIC PFAM family (pf02661, release 24). Adenylylation assays were carried out 
using bacterial crude cell lysates for VbhT constructs or purified proteins for 
NmbFic, SoFic and HYPE constructs. For structure determination, VbhA and 
VbhT(FIC) were co-expressed in the BL21(DE3) strain of E. coli, and NmFic, 
SoFic(A8) and NmFic(SE/AA) in the BL21-AI strain of E. coli. VbhA- 
VbhT(FIC) and NmFic were purified by affinity chromatography followed by size 
exclusion chromatography. An additional anion exchange chromatography step 
was performed for NmFic(A8)and NmFic(SE/AA). Diffraction data were collected 
at beamline X06SA of the Swiss Light Source. For NmFic(A8)and NmFic(SE/AA), 
we obtained phases by molecular replacement using the NmFic structure (PDB 
code 2G03) as a search model. The VbhA-VbhT(FIC) complex was solved by 
molecular replacement using a fragment of BepA (PDB code 2JK8). 


Full Methods and any associated references are available in the online version of 
the paper at www.nature.com/nature. 
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METHODS 


Identification of VbhA homologues. VbhA of B. schoenbuchensis was queried 
against a database composed of translated open reading frames (>10 amino acids) 
identified in the 500-bp upstream region of all fic loci (PFAM release 24, 2,189 
proteins). Nine rounds of Psi-Blast were performed and hits with an E-value <1 
manually validated. 

E. coli toxicity tests and cell filamentation. E. coli AB472, a derivative of 
MG1655, was transformed with VbhT-expressing plasmids and always handled 
in LB containing 1% glucose. A single colony was picked, resuspended in 20 ll of 
LB and plated on LB plates containing 100 uM IPTG (induction) or 1% glucose 
(repression). Plates were incubated overnight at 37°C. Growth curves were 
acquired by measuring optical density of liquid cultures in LB containing 100 
uM IPTG that had been inoculated from overnight cultures of single colonies 
and were grown at 30 °C with continuous shaking. Cell filamentation was visua- 
lized by co-transformation of plasmid pC10E that constitutively expresses GFP 
and examined using fluorescence microscopy. 

Conjugation experiments for VbhT toxicity tests in Bartonella. Plasmids were 
introduced into Bartonella strains by conjugation from E. coli using three-parental 
mating. Bartonella strains were grown for 36-48h at 35°C with 5% CO, on 
Columbia base agar plates supplemented with 5% defibrinated sheep blood and 
100 pg ml streptomycin. E. coli 82150 that harbours helper plasmid pRK2013, 
and E. coli 82150 that contains the VbhT-expressing plasmid were grown over- 
night at 37°C in LB supplemented with 50pgml * kanamycin or 30 pg ml * 
chloramphenicol, respectively, and both media were also supplemented with dia- 
minopimelic acid (DAP) and 1% glucose. After 16h of incubation, E. coli strains 
were diluted 1:50 in fresh LB medium and grown to a opticial density (OD) at 
595 nm (ODs9snm) of 0.4-0.8. Subsequently, each E. coli strain was diluted to 
ODsosnm Of 0.25, washed once and resuspended in supplemented M199 
medium (containing 10% FCS and 1% glucose). Bartonella strains were collected 
in 1 ml and resuspended in 60 ttl of M199 (ODso5snm = 1). Each Bartonella sus- 
pension was mixed with 20 ul of E. coli 82150 that harbours pRK2013 and 20 ll of 
E. coli 82150 that harbours the VbhT-expressing plasmid. The conjugation mix 
was distributed on a conjugation filter on a columbia blood agar (CBA) plate 
supplemented with 150 ul DAP and 150 pl 1% glucose. After 6h of incubation 
under Bartonella growth conditions as described before, the bacteria were washed 
off the filter with 1 ml supplemented M199. Dilutions of bacterial suspensions 
were plated on lysogeny broth agar (LA) supplemented with DAP, 1% glucose and 
301g ml! chloramphenicol for selecting donors, on CBA supplemented with 
1.2 ugml | chloramphenicol for selecting transconjugants, and on CBA supple- 
mented with 100 1g ml’ streptomycin for selecting recipients. Agar plates were 
incubated under Bartonella growth conditions and colony-forming units were 
counted after 1 day for donors and after 7 days for recipients and transconjugants. 
In vitro adenylylation assay. Adenylylation activity of VbhT was assessed in an 
assay using lysates of ectopically expressing E. coli. Bacterial pellets were resus- 
pended in reaction buffer (50 mM Tris-HCl pH 8.0, 150 mM NaCl, 0.1 mM EGTA, 
15mM MgCh, 140 pg ml | RNase A and protease inhibitor cocktail (Roche)) and 
lysed by sonication. After clearing lysates by centrifugation, supernatants were 
used for experimentation or stored at —20 °C. 

Adenylylation reactions were prepared by supplementing 15 pl supernatant 

from expression cultures with 10 Ci [o-**PJATP (Hartmann Analytic) and 
25 ul blank E. coli supernatant. Adenylylation activity of NmFic and SoFic con- 
structs was assessed by incubating 60 jig purified protein with 10 Ci [«-*’P]ATP 
(Hartmann Analytic) and 25 yl blank E. coli supernatant. Reactions were incu- 
bated for 1h at 30°C, resolved by SDS-PAGE, and adenylylation was probed by 
autoradiography. For HYPE(FIC) assays, 15 ng of pure protein was incubated with 
1.6 1g of purified GTPases. 
Protein expression and purification. pFVS0011 vector (encoding VbhA and 
VbhT(FIC)) was transformed into E.coli BL21 (DE3). E. coli cultures were grown 
at 37°C in LB medium supplemented with 50pg ml”! of kanamycin to an 
ODs05nm Of 0.6 before induction with 0.3mM IPTG for 16h at 23°C. Vectors 
pFVS0015 (carrying the NmFic gene), pFVS0016 (encoding NmFic(A8)), 
pFVS0037 (encoding NmFic(SE/AA)) were transformed into BL21-AI cells. 
Cells were incubated overnight in 750 ml LB medium that was supplemented with 
50 ug ml ! kanamycin and 1% glucose at 23 °C at 200 r.p.m. until ODso5nm of 2 was 
reached. Bacterial pellets were resuspended in terrific broth medium containing 
50 pg ml ' kanamycin to obtain an ODs95nm of approximately 1. Protein expression 
was induced at 23 °C with 0.1% arabinose and 0.5 mM IPTG for 23h at 200 r.-p.m. 
Plasmids harbouring HYPE(FIC) and SoFic constructs were transformed in E. coli 
Rosetta (DE3) cells and BL21-AI cells, respectively. The proteins were expressed as 
described for NmFic. 

Cells containing overexpressed VbhA-VbhT(FIC) and NmFic were resus- 
pended in lysis buffer containing 20mM Tris (pH7.5), 250mM NaCl and 
25mM imidazole and disrupted using French press. Cell debris was pelleted by 
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ultracentrifugation and the supernatant was applied to a His-Trap column (GE 
Healthcare). The stable complex was eluted with a gradient of elution buffer 
containing 20mM Tris (pH7.5), 250mM NaCl and 500mM imidazole. The 
protein was then concentrated and injected on a Superdex 75 16/60 gel filtration 
column (GE Healthcare) equilibrated with 10mM Tris (pH7.5) and 100mM 
NaCl. The pure proteins were concentrated to 6mgml ' for VbhA-VbhT(FIC) 
and 53 mgml ! for NmFic. 

The same purification protocol was used for NmFic(A8) and NmFic (SE/AA), 
but with an additional intermediate purification step. After affinity purification, 
the proteins were adjusted to 20mM Tris (pH 8.5), 25mM NaCl, applied to a 
Resource-Q anion exchange column (Amersham Biosciences) and eluted with a 
linear gradient of NaCl. Peak fractions were concentrated and further purified by 
gel filtration chromatography. Purified proteins in 10 mM Tris (pH 7.8), 100 mM 
NaCl were concentrated to 30mgml~' for NmFic(A8) and 51mgml' for 
NmFic(SE/AA). SoFic and SoFic(E73G) were purified as described previously’’. 
HYPE(FIC), HYPE(FIC/H295A) and HYPE(FIC/E234G) were purified in the 
same way as NmFic. GST-tagged Cdc42 and Racl were expressed and purified 
as described previously'*””. 

Crystallization. All crystals were obtained at 20°C (except for NmFic crystals, 
which were obtained at 4°C) using the hanging-drop vapour diffusion method 
after mixing 1 jl protein solution with 1 pl reservoir solution. VobhA-VbhT(FIC) 
and NmFic(SE/AA) crystallized in 23% (w/v) PEG 3350 and 0.2 M di-ammonium 
tartrate, and were cryoprotected with 25% (w/v) PEG 3350, 0.2 M di-ammonium 
tartrate and 10% glycerol. NmFic crystallized in 5% 2-propanol, 0.1 M MES pH 6.0 
and 0.1 M Ca-acetate, and was transferred into 8% 2-propanol, 0.1 M MES pH 6.0, 
0.1M Mg-acetate and 15% glycerol, then into 30% glycerol for cryoprotection. 
NmFic(A8) crystallized with 44% (v/v) PEG 600, 0.1 M Na-citrate pH5.6. No 
cryoprotection was needed for data collection. In each case, the substrate analogue 
complex was produced by crystal soaking for 2h with 10 mM AMPPNP, 10 mM 
MgCl. 

Structure determination. Statistics of data collection and refinement are given in 
Supplementary Table 1. Diffraction data were collected at beamline X06SA (PXIII) 
of the Swiss Light Source (2, = 1.0 A) at 100 Kona MARCCD detector, processed 
using MOSFLM”, and scaled with SCALA”'. The structures were determined by 
molecular replacement (PHASER) using a BepA fragment (PDB code 2JK8"°, 
residues 30-194) or the uncomplexed NmFic structure (PDB code 2G03, unpub- 
lished, Midwest Center for Structural Genomics) as models for structure solution 
of VbhA-VbhT(FIC) and the different NmFic proteins, respectively. A structure 
solution of wild-type and mutant NmFic in complex with AMPPNP was straight- 
forward, whereas a weak solution (RFZ = 5.1, TFZ = 3.6) with poor phasing 
power was obtained for VbhA-VbhT(FIC). The partial model lacking VbhA 
was refined by rigid-body refinement using REFMACS (ref. 23) with three bodies 
to an Rfree of 52%. Model extension using the module AutoBuild of the PHENIX 
package** yielded an almost complete model (R/Rgeo = 30.8%/35.2%). The 
remainder of the molecule was traced manually with COOT” and then by full 
refinement using PHENIX™. The Ramachandran plot showed that more than 99% 
of the residues are in favoured regions of the four structures. The figures were 
generated with Dino (http://www.dino3d.org). 

Prediction of inhibition motif in Fic proteins. All Fic proteins (PFAM release 
24, 2,189 proteins) were subjected to a profile-to-profile comparison with 
sequences from the PDB using HHpred'’”. HHpred builds an alignment of homo- 
logues for each query sequence by using iterations of PSI-BLAST searches against 
the non-redundant database. Secondary structures are then predicted on the PSI- 
BLAST alignment using PSIPRED”*. On the basis of this data, a profile Hidden 
Markov Model (HMM) is generated. Each query profile is compared with the pre- 
computed HMMs of the proteins in the PDB to identify structural homologues. In 
terms of query profiles, the PDB profiles include secondary structure information 
derived from their three-dimensional structure. We analysed the pairwise profile 
alignments of each Fic protein with the eight different Fic family members deposited 
in the PDB. A Fic protein was predicted to belong to class II or class III if the 
templates’ inhibitory motifs were aligned to a corresponding query sequence in 
the profile alignments (see Supplementary Information, section 1). 

Phylogenetic analysis of Fic proteins. Phylogenetic trees of the Fic family were 
inferred with FastTree 2 (ref. 27) and RAxML 7.0.4 (ref. 28). Trees were built on 
the amino acid alignment provided by the PFAM database. Unaligned overhanging 
ends were trimmed off and identical sequences were reduced to one representative. 
We used local support values based on the Shimodaira—Hasegawa test to estimate 
the reliability of the tree inferred with FastTree2. The RaxML tree was inferred using 
the PROTMIXWAGEF model and 25 rate categories. 

Liquid chromatography-mass spectrometry analysis. 2.51M of purified 
NmFic, NmFic(A8) or NmFic(SE/AA) were incubated in reaction buffer 
(10mM Tris, pH8, 100mM NaC)) in the presence or absence of 50 4M ATP 
and 50M MgCl, for 1h. Proteins were reduced in 5mM TCEP, alkylated in 
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10 mM iodacetamid and digested with sequencing grade trypsin (Promega). The 
generated peptides were purified with C18 Microspin columns (Harvard 
Apparatus) and analysed using liquid chromatography-mass spectrometry 
(LC-MS) or MS on an easy nano-LC system coupled to an LTQ-Orbitrap- 
Velos mass spectrometer (both from Thermo-Fisher Scientific), as recently 
described” using a linear gradient from 95% solvent A (0.15% formic acid, 2% 
acetonitrile) and 5% solvent B (98% acetonitrile, 0.15% formic acid) to 35% solvent 
B over 40 min. The data acquisition mode was set to obtain one high-resolution 
MS scan in the Fourier Transform (FT) part of the mass spectrometer at a reso- 
lution of 60,000 (full width at half maximum) and MS-MS scans in the linear ion 
trap of the 20 most intense ions. The resulting MS2 scans were searched against a 
N. meningitidis protein database containing the target protein sequence, including 
NmFic and NmFic(SE/AA) sequences, that was obtained from EBI (http://www. 
ebi.ac.uk) using the SEQUEST search algorithm provided in the Proteome 
Discoverer software package (Thermo-Fisher Scientific). In silico trypsin digestion 
was performed after lysine and arginine (unless followed by proline), with a tol- 
erance of two missed cleavages in fully tryptic peptides. Database search parameters 
were set to allow phosphoadenosine modification (+329.05252 Da) of threonine 
and tyrosine residues as variable modification and carboxyamidomethylation 
(+57.021464 Da) of cysteine residues as fixed modification. The fragment mass 
tolerance was set to 0.8 Da and the precursor mass tolerance to 15 p.p.m. 

Strain construction. For toxicity experiments, the vbhT wild-type gene 
(FN645515) from B. schoenbuchensis R1 was cloned into pRSF-Duet1 
(pPE0017, Hisg-tagged vbhT). VbhT(H136A) (pPE0034) was constructed by 
introducing a two-base-pair point mutation in the FIC motif of vbhT of 
pPEO0017, as described elsewhere’. Plasmid co-expressing VbhT and VbhA 
(VbhT/VbhA) was constructed by cloning vbhA (FN645515) amplified from B. 
schoenbuchensis R1 into pRSF-Duet1 (pPE0020, HA-tagged vbhA). vbhT was then 
cloned into pPE0020, resulting in pPE0021. To construct VbhT-expressing plasmids 
for Bartonella, vbhT from B. schoenbuchensis R1 was cloned into vector pMMB206 
(ref. 31) (pVbhT, HA-tagged vbhT). pVbhT(H136A) was constructed from pVbhT 
as described before. 

The in-frame deletion of the complete vbhA/vbhT operon in B. schoenbuchensis 
(Bsch AvbhA/vbhT) was generated as described previously by a two-step gene 
replacement procedure’. The mutagenesis vector pPE3005 was constructed by 
ligating a cassette with the flanking regions of the in-frame deletion into pTR1000°. 

For protein purification, the full-length vbhA gene and part of the vbhT gene 
encoding the FIC domain (vbhT(FIC)), amino acid residues 1-198, His,-tagged) 
were amplified from plasmid pPE0021 and cloned into the pRSF-Duetl vector 
(pFVS0011). VbhA(E24G)/VbhT(FIC) expression plasmid (pFVS0065) was 
generated by introducing a two-base-pair mutation in pFVS0011. The NmFic gene 
was amplified with an N-terminal Hisg-tag from N. meningitidis from coding 
region of amino acid residues 11-191 and from coding region of amino acid 
residues 11-167 to generate plasmids expressing NmFic (pFVS0015) and 
NmFic(A8) (pFVS0016), respectively. The S182A/E186A double mutant 
construct (NmFic(SE/AA)) was generated by introducing two subsequent point 
mutations in pFVS0015. The E186G mutant construct (NmFic(E186G)) was 


generated by the same approach. The SoFic gene was amplified from plasmid 
(ASU biodesign institute, Clone ID SoCD00104192) and cloned with an 
N-terminal His¢-tag into pRSF-Duetl (pFVS0040). The SoFic(E73G) plasmid 
(pFVS0058) was generated by introducing a two-base-pair point mutation in 
pFVS0040. GST-HYPE(E234G) (pFVS0064) was generated by introducing point 
mutations in the plasmid containing GST-HYPE. From these plasmids, shorter 
constructs (HYPE(FIC), HYPE(FIC/E234G) and HYPE(FIC/H295A)) only 
carrying the FIC domain of HYPE (from amino acids 187 to 437) were generated. 
For the expression of human Cdc42 and Racl, the Cdc42-Q61L and Racl-Q61L 
coding sequences were amplified from plasmid pRKS5myc L61 Cdc42” and 
pRK5FLAG L61 Racl (ref. 32) and cloned into pGex6p1 with an N-terminal 
GST-tag, resulting in pAH088 and pAH060, respectively. The wild-type variants 
of Cdc42 and Racl were generated from the mutant constructs through polymerase 
chain reaction (PCR)-based site-directed mutagenesis” (resulting in pAH059 and 
pAHO071). All primers and the resulting vectors are summarized in Supplementary 
Tables 5 and 6. 
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Cysteinyl leukotriene type I receptor desensitization 
sustains Ca** -dependent gene expression 


Siaw-Wei Ng’, Daniel Bakowski!, Charmaine Nelson’, Ravi Mehta!, Robert Almeyda’, Grant Bates” & Anant B. Parekh! 


Receptor desensitization is a universal mechanism to turn off a 
biological response; in this process, the ability of a physiological 
trigger to activate a cell is lost despite the continued presence of the 
stimulus. Receptor desensitization of G-protein-coupled receptors 
involves uncoupling of the receptor from its G-protein or second- 
messenger pathway followed by receptor internalization’. G-protein- 
coupled cysteinyl leukotriene type I (CysLT1) receptors regulate 
immune-cell function and CysLT1 receptors are an established thera- 
peutic target for allergies, including asthma’. Desensitization of 
CysLT1 receptors arises predominantly from protein-kinase-C- 
dependent phosphorylation of three serine residues in the receptor 
carboxy terminus’. Physiological concentrations of the receptor 
agonist leukotriene C, (LTC,) evoke repetitive cytoplasmic Ca** 
oscillations, reflecting regenerative Ca’* release from stores, which 
is sustained by Ca** entry through store-operated calcium-release- 
activated calcium (CRAC) channels*. CRAC channels are tightly 
linked to expression of the transcription factor c-fos’, a regulator 
of numerous genes important to cell growth and development’. 
Here we show that abolishing leukotriene receptor desensitiza- 
tion suppresses agonist-driven gene expression in a rat cell line. 
Mechanistically, stimulation of non-desensitizing receptors evoked 
prolonged inositol-trisphosphate-mediated Ca** release, which led 
to accelerated Ca”* -dependent slow inactivation of CRAC channels 
and a subsequent loss of excitation-transcription coupling. Hence, 
rather than serving to turn off a biological response, reversible 
desensitization of a Ca** mobilizing receptor acts as an ‘on’ switch, 
sustaining long-term signalling in the immune system. 

The paradox of receptor desensitization is how long-term responses 
can be evoked if the receptor is inactivated. This is a particularly acute 
problem in immune cells, in which cell differentiation and clonal 
selection develop over hours in the continued presence of external 
cues. 

Stimulation of rat basophilic leukaemia (RBL-1) cells with LTC,, 
acting exclusively on CysLT1 receptors’* (Supplementary Fig. 1), led 
to cytoplasmic Ca”™ signals (Fig. 1a) followed by robust expression of 
c-fos at both messenger RNA (Fig. 1b, c)* and protein levels (Fig. 1d, e). 
Maximal activation of CRAC channels with thapsigargin led to a similar 
increase in c-fos expression (Fig. lb-e). Both LTC, and thapsigargin 
induce c-fos expression through the build-up of Ca** microdomains 
near open store-operated CRAC channels**. Thapsigargin led to a larger, 
more sustained Ca** signal than LTC, (Fig. 1a)*° and the rate of Ca?* 
entry through CRAC channels was approximately twofold more for 
thapsigargin than LTC, (Fig. 1f), consistent with patch-clamp record- 
ings’. The similar increase in Ca** -dependent c-fos expression to LTC, 
and thapsigargin was therefore surprising, given the significant differ- 
ence in CRAC channel activation. 

We considered various explanations for why CysLT1 receptor 
activation and thapsigargin evoked similar c-fos levels despite marked 
differences in the extent of CRAC channel activation. These included: 
(1) CysLT1 receptors tapped into a different signalling mechanism 
linking CRAC channel microdomains to c-fos expression; (2) local 
Ca’* entry through CRAC channels was larger after receptor activation 


because LTC, hyperpolarized the membrane potential; and (3) cyto- 
plasmic Ca’* and protein kinase C (PKC) interacted synergistically to 
drive gene expression in response to CysLT1 receptor activation. 
Evidence against these possibilities is presented in Supplementary 
Figs 2-4. Instead, gene expression showed high sensitivity to Ca”* entry, 
enabling CysLT1 receptor activation to couple effectively to c-fos tran- 
scription (Supplementary Fig. 5), as well as rapidity and high gain. 
Combined, this ensures efficient gene expression to bursts of CRAC 
channel activity after physiological levels of receptor stimulation. 

Experiments described in Fig. 1g and h revealed an important role 
for PKC in receptor-dependent gene expression. The structurally dis- 
tinct PKC blockers G06983 and calphostin C abolished c-fos expres- 
sion (Fig. 1g, h). PKC block had a marked effect on the Ca** signal 
evoked by agonist. Whereas cytoplasmic Ca** oscillations were 
routinely observed with LTC, (Fig. li), the response was converted 
into a large, single, slowly decaying Ca** spike after PKC inhibition 
(Fig. 1i, j). Acute stimulation with PMA in the absence of LTC, failed 
to induce significant c-fos expression (data not shown)’, demonstrat- 
ing that PKC activity per se was not sufficient to induce c-fos expres- 
sion in these cells. The PKC inhibitors had no effect on thapsigargin- 
evoked c-fos expression (Fig. 1k) or cytoplasmic Ca”* signals (Fig. 11). 
Thapsigargin (211M) activates CRAC channels maximally (Sup- 
plementary Fig. 5) and, by blocking SERCA pumps (which can be 
located near CRAC channels!°), reduces the decay of Ca’* gradients 
radiating from the plasma membrane. It is possible that other non- 
receptor-dependent stimuli, which raise local Ca*~ levels less effec- 
tively than 2M thapsigargin, might activate c-fos in a manner 
dependent on basal PKC activity, but this activity would be unusual 
in that it is not stimulated acutely by PMA in the presence of sub- 
maximal CRAC channel activation (Supplementary Fig. 4). 

In RBL cells, exposure to the phorbol ester PMA for several hours 
downregulates several PKC isozymes". Using this protocol, we found 
that c-fos expression was substantially reduced in response to CysLT1 
receptor stimulation (Fig. 1m), whereas no significant reduction was 
seen when thapsigargin was used instead (Fig. 1m) or when inactive 
40-phorbol replaced PMA (data not shown). Similar to PKC blockers, 
the Ca** signal to LTC, was prolonged after PKC downregulation 
(Fig. In). This prolonged Ca** signal did not reflect a change in 
Ca* clearance mechanisms (Supplementary Fig. 6); instead, it is char- 
acteristic of loss of receptor desensitization, particularly for CysLT1 
receptors, in which desensitization is mediated predominantly by 
PKC? and prevention of desensitization leads to broader Ca’* signals’. 
Inhibition of CysLT1 receptor desensitization is predicted to lead to 
greater inositol trisphosphate (InsP3;) production and hence more 
extensive Ca~* store emptying. Several findings are consistent with this. 
First, Ca”* release to LTC, lasted approximately five times longer 
when PKC was blocked than in control cells (Fig. 2a, expanded in 
inset). Second, the amount of Ca’* remaining within the stores, 
measured as the ionomycin-sensitive Ca’? response’, was sub- 
stantially less after activation of CysLT1 receptors in the presence of 
PKC block than in control cells (Fig. 2a). Third, InsP; production, 
measured using the green fluorescent protein—pleckstrin homology 
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Figure 1 | CysLT1-receptor-dependent c-fos expression requires PKC. 

a, Averaged Ca’ signals to LTC, and thapsigargin (Thap.) are compared (>50 
cells per graph). R (356/380), ratio of fluorescence measured at 356 and 380 nm. 
b, c-fos expression is compared between control (non-stimulated), 160 nM 
LTC,- and 2 tM thapsigargin-stimulated cells. Stimulus was present for 8 min. 
c, Histograms show averaged responses from three independent experiments. 
LTC, and thapsigargin groups were different from control (P < 0.001), but not 
from one another (P > 0.3; analysis of variance (ANOVA)). d, Cells stained 
with antibody against c-Fos protein. DAPI, 4’ ,6-diamidino-2-phenylindole. 

e, Aggregate data are compared (n > 20 per bar). Thapsigargin and LTC, 
groups were different from control (P< 0.001) but not from one another 

(P = 0.11). f£, Ca?* entry rate was measured after readmission of Ca?* to cells 
stimulated with LTC, or thapsigargin in Ca?* -free solution (*P < 0.01). 

g, G06983 (1 uM; 10 min pre-treatment) suppresses LTC,-induced c-fos 
expression. h, Histogram comparing the effects of PKC blockers. LTC, control 
group (LTC, in the absence of PKC block) was different from the other groups 


(GFP-PH) construct”, increased to a greater extent when PKC was 
inhibited (Fig. 2b). 

Cytoplasmic Ca”* inhibits CRAC channels through mechanisms of 
fast and slow inactivation'®. The prolonged Ca** release evoked by 
LTC, in the presence of non-desensitizing receptors could therefore 
inactivate CRAC channels to suppress agonist-evoked gene expres- 
sion. In support of this, accumulation of the slow Ca”* chelator 
EGTA in the cytoplasm rescued gene expression to CysLT1 receptor 
activation in the presence of PKC block (Fig. 2c, d). Ca”*-dependent 
fast inactivation of CRAC channels is unlikely to contribute here 
because (1) it is unaffected by the slow chelator EGTA'®”’”, which 
reversed the inhibitory effects of PKC block (Fig. 2c, d), and (2) the 
rate and extent of fast inactivation were unaltered by CysLT1 receptor 
activation in the presence of PKC downregulation (Fig. 3a). Instead, 
Ca’ * -dependent slow inactivation is likely to be the dominant mech- 
anism because (1) it too is suppressed by cytoplasmic EGTA™®”’, (2) the 
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(P< 0.01). There were no significant differences between the other groups. GO, 
G06983. Cal. C, calphostin C. i, Single-cell cx signals to LTC, are compared 
for the conditions shown. j, Averaged data are compared (>45 cells for each 
condition). k, Histogram showing c-fos expression to thapsigargin in the 
presence of PKC blockers. All thapsigargin-treated groups were significantly 
different from control (P < 0.001) but were not significantly different from one 
another. I, Ca” signals to thapsigargin are unaffected by PKC block. 0 Ca’*, 
Ca’ -free external solution; 2 Ca’~, external solution containing 2 mM Ca?*, 
m, Downregulation of PKC (PMA; 500 nM, 24h) reduces LTC,- but not 
thapsigargin-induced c-fos expression (data from four independent 
experiments). All stimulated groups were significantly different from control 
(P< 0.01). For LTC, the PMA group was different from the LTC, control 
(P< 0.01). For thapsigargin, the PMA groups was not different from the 
thapsigargin control (P = 0.07). Ctrl, control. n, PKC downregulation alters the 
LTC,-evoked Ca*" signal. Error bars show data + s.e.m. NS, not significant. 


Ca?* dependence of slow inactivation has a dissociation constant (Ka) 
of ~0.5 1M and full block occurs at ~1 LM (Fig. 3b), which is similar to 
the peak Ca** rise evoked by LTC, in the presence of PKC inhibitors or 
following downregulation of PKC (0.87 + 01M), and (3) Ca’*- 
dependent slow inactivation develops with a time course similar to 
the duration of the prolonged Ca** rise caused by LTC, following loss 
of PKC activity’®”. 

If prolonged Ca** release to non-desensitizing CysLT1 receptors 
leads to slow inactivation of CRAC channels, then development of a 
calcium-release-activated calcium current (crac) to a subsequent 
stimulus should be impaired. Pre-activation of CysLT1 receptors 
reduced Icrac evoked by thapsigagin but only in the presence of 
PKC block (Fig. 3c). No such inhibitory effect was seen when cells 
were dialysed with a strongly buffered Ca**-containing pipette solu- 
tion, which prevents the development of slow inactivation (Fig. 3d). 
Increasing the time between the termination of Ca~* release and 
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Figure 2 | Gene expression to non-desensitizing CysLT1 receptors is 
rescued by preventing a cytoplasmic Ca’* rise. a, Stimulation with LTC, in 
the presence of G06983 evokes a more sustained Ca’ release response, and this 
leads to more extensive store depletion (measured through the extent of Ca”* 
release evoked by 5 1M ionomycin). Both LTC, and ionomycin were applied in 
Ca?*-free external solution. Inset compares the kinetics of Ca*" release. 

b, Cytosolic GFP-PH levels, a measure of InsP; levels, rise when CysLT1 
receptors are stimulated in the presence of G06983. c, Top, loading cells with 
the Ca** chelator EGTA prevents loss of gene expression to agonist when PKC 
is blocked. Bottom, aggregate data from five independent gels are summarized. 
d, As in c, but calphostin C was used to block PKC instead. Bottom, aggregate 
data from three independent gels are summarized. Error bars show data + 
s.e.m. NS, not significant. 


subsequent store-operated Ca** entry should enable some recovery 
from Ca** -dependent slow inactivation and this should partially rescue 
gene expression. When Ca~~ influx was evoked a few minutes after 
Ca** release, significant, albeit incomplete, rescue of Ca" entry 
(Fig. 3e) and c-fos transcription (Fig. 3f) occurred in cells stimulated 
with LTC, in the presence of PKC inhibition. Hence, allowing CRAC 
channels time to recover from Ca”* -dependent inactivation results in 
partial rescue of agonist-driven gene expression. 

Our attempts to express the PKC-insensitive CysLT1 receptor, in 
which $313, S315 and S316 had been mutated to alanines, were 
thwarted by the difficulty of expressing these receptors’, although in 
a few cells we observed that Ca** oscillations to LTC, were less fre- 
quent (3.1 + 0.5 versus 5.4 + 0.4, 4 and 6 cells, respectively) and the 
initial spike was a little broader (~1.25-fold) than mock-transfected 
cells. 

To place our findings in a physiological context, we turned to the 
human nasal polyp, which is rich in mast cells’. The polyp and asso- 
ciated nasal mucosa are largely self-contained, providing an excellent 
‘quasi in vivo human system’. Mast cells from polyps, acutely isolated 
from patients undergoing surgery, respond to LTC, and express func- 
tional CRAC channels”®. Stimulation with LTC, activated c-Fos protein 
expression in mast cells isolated from polyps (Fig. 3g) and this was 
reduced by pre-treatment with either calphostin C or G06983 
(Fig. 3h). PKC inhibitors had no inhibitory effect when thapsigargin 
was used instead. 

Western blots revealed the presence of Ca**-dependent PKC-«, -B 
and -C isozymes’ but only faint expression of PKC-6 and -¢ (Fig. 4a). 
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Figure 3 | Ca”*-dependent slow inactivation underlies suppression of c-Fos 
expression to non-desensitizing CysLT1 receptors. a, Ca”‘-dependent fast 
inactivation is unaffected by non-desensitizing receptors (labelled — PKC). 
Cells were stimulated with LTC, (160 nM) before breaking in with a pipette 
solution containing thapsigargin and buffered Ca** (140 nM) and fast 
inactivation was measured within 60s of break-in. b, Dependence of Ca 
dependent slow inactivation on patch pipette Ca”* concentration. pA, 
picoamperes; pF, picofarads. [Ca** ] pipettes concentration of Ca?* in pipette. 
c, Stimulation of non-desensitizing receptors with LTC, before break-in 
significantly reduced the size of Icrac that developed in response to dialysis 
with thapsigargin in weak buffer (0.2 mM EGTA). d, As in ¢, but cells were 
dialysed with a pipette solution containing strong Ca~* buffer (10 mM EGTA, 
140 nM free Ca**). e, Store-operated Ca”* entry recovers partially by increasing 
the time interval between Ca” * release and subsequent Ca’* entry. f, c-fos 
expression to non-desensitizing receptor stimulation is rescued partially when 
Ca’* entry occurs several minutes after Ca** release has reached completion. 
g, c-Fos expression in human nasal mast cells after CysLT1 receptor activation 
is suppressed by PKC inhibition. h, Aggregate data are compared (12-17 cells 
per bar; three patients each). Error bars show data + s.e.m. NS, not significant. 


Overnight PMA exposure significantly reduced PKC-« and -f 
expression, but not PKC-C (Fig. 4a, b). Their weak expression made 
PKC-6 and -¢ difficult to quantify. Confocal microscope studies con- 
firmed robust expression of PKC-«, -f and -¢ (Fig. 4c), with barely 
detectable levels of PKC-6 and -¢ (data not shown). Overnight PMA 
exposure significantly reduced PKC-o and -f but not -¢ at the cellular 
level (Fig. 4d). Knockdown of PKC-« using a targeted short interfering 
RNA (siRNA) approach (Fig. 4e) resulted in a broadening of the first 
Ca’* oscillation evoked by LTC,, indicative of less receptor desens- 
itization, and fewer Ca*™ oscillations in each cell (Supplementary Fig. 
7). Knockdown of PKC-B had a much weaker effect on the Ca** 
oscillations (Supplementary Fig. 7). Knockdown of PKC-a or PKC-« 
plus -B simultaneously, but not PKC-f alone, reduced LTC,-driven 
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Figure 4 | PKC-a regulates CysLT1 receptor-driven c-fos transcription. 

a, Expression of PKC-a, -f and -¢ (western blot) is shown in control cells and 
cells exposed to PMA for 24h. b, Quantification of data from three independent 
experiments, as in a. c, Confocal microscope images of PKC expression for the 
conditions shown. Cells were fixed before analysis. Original magnification, 
100. d, Quantification of images from experiments, as in c. e, siRNA against 
PKC-« or -B significantly reduces corresponding protein expression. Left, 
DAPI staining of nuclei; middle, GFP expression (indicating transfection); and 
right, PKC-« expression after siRNA-mediated knockdown. Right, aggregate 
data from four experiments are depicted. Both siRNA groups were different 
from control (P< 0.005). Error bars show data + s.e.m. NS, not significant. 
f, Knockdown of PKC-z, -B and PKC-a plus -8 on LTC,-dependent c-fos 
expression. Data are compared with mock-transfected cells. For comparison, 
24h exposure to PMA is included. All treated groups were significantly 
different from the LTC, control (black bar) group except siRNA PKC-B 
knockdown (P > 0.1). PKC-« plus -B and 24h PMA groups had P < 0.01; 
PKC-o group had P < 0.05. Error bars show data + s.e.m. NS, not significant. 


c-fos expression to an extent similar to that seen after overnight PMA 
treatment (Fig. 4f). Stimulation of non-G-protein-coupled FCeRI 
antigenic receptors in RBL-2H3 cells activates c-fos expression 
primarily through PKC-6 and -e”. Although it is possible that these 
PKC isoforms also contribute to gene expression under our conditions, 
our results nevertheless suggest a major role for PKC-« in G-protein- 
coupled-receptor desensitization, and thus coupling to the nucleus. 
Collectively, our findings reveal a counterintuitive function for 
desensitization of a phospholipase-C-coupled receptor. Rather than 
terminating a response, homologous receptor desensitization is essen- 
tial for maintaining excitation-transcription coupling. Desensitization 
of CysLT1 receptors is mediated principally by PKC-dependent phos- 
phorylation’. Prevention of receptor desensitization through either 
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acute block or degradation of PKC or after knockdown of PKC-« all 
led to loss of Ca**-dependent gene expression, despite potentiation of 
Ca’* release to agonist. Mechanistically, the prolonged Ca’* release 
phase accelerated Ca”* -dependent slow inactivation of CRAC channels, 
resulting in loss of Ca** entry. Because Ca** microdomains near open 
CRAC channels drive c-fos expression, the decline in CRAC channel 
activity abolishes excitation-transcription coupling. The interval 
between Ca” oscillations after CysLT1 receptor activation is ~25s 
(ref. 4). Because InsP3 has a short half-life in the cytoplasm (~1s)”’, 
receptor desensitization will presumably lower InsP3 levels during the 
interspike interval. Store refilling will occur quickly and CRAC channel 
activity will be transient after CysLT1 receptor stimulation. The short 
duration of Ca”* release and thus Ca’* entry, determined by receptor 
desensitization, will ensure Ca**-dependent slow inactivation does not 
develop, as this inhibitory mechanism requires a sustained Ca’ * rise for 
several seconds. It is therefore the kinetics of receptor desensitization 
and recovery from desensitization within a highly Ca** sensitive and 
high-gain system that ensures bursts of store-operated Ca** entry occur 
that are sufficient for the activation of c-fos expression, without the 
build-up of the Ca** -dependent slow inactivation pathway that would 
abolish the response. 


METHODS SUMMARY 


Cultured RBL-1 cells and acutely isolated human nasal polyps were prepared as 
described previously’. Transfection with siRNA constructs was achieved using the 
AMAXA system. Patch-clamp recordings and cytoplasmic Ca”* measurements 
followed standard methods’. RT-PCR, western blotting and confocal microscopy 
were carried out as described previously’. Error bars are s.e.m. 


Full Methods and any associated references are available in the online version of 
the paper at www.nature.com/nature. 
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METHODS 

Cell culture and transfection. RBL-1 cells were bought from ATCC and were 
cultured (37 °C, 5% CO.) in Dulbecco’s modified Eagle medium with 10% fetal 
bovine serum, 2mM 1-glutamine and penicillin-streptomycin, as previously 
described**. Cells were transfected with siRNA against Syk (purchased from 
Invitrogen; 5'-3’ sense, CCCUCUGGCAGCUAGUGGAACAUUA; antisense, 
UAAUGUUCCACUAGCUGCCAGAGGG) using the Amaxa nucleofection 
system’. siRNA against PKC isozymes were purchased from Invitrogen; PKC-« 
sense, GCGACACCUGCGACAUGAAUGUUCA ; antisense, UGAACAUUCAU. 
GUCGCAGGUGUCGC; PKC-f sense, GGCUAAUGACCAAACACCCAGGC 
AA; antisense, UUGCCUGGGUGUUUGGUCAUUAGCC. Cells were used 36- 
48 h after plating. 

Human tissue. Human polyps were removed from patients with full ethical con- 
sent and with approval from the National Research Ethics Service (REC number 
07/H0607/104). Polyps were treated as described’ and used within 6h of surgical 
removal. 

ICRAC recordings. Patch-clamp experiments were conducted in the tight-seal 
whole-cell configuration at room temperature (20-24°C) as previously 
described’*. Sylgard-coated, fire-polished pipettes had resistances of 4.2- 
5.5 MQ when filled with standard internal solution that contained (in mM): 
Cs* glutamate 145, NaCl 8, MgCl, 1, Mg-ATP 2, EGTA 10, CaCl, 4.6 mM (free 
Ca?* ~140 nM), HEPES 10, pH 7.2 with CsOH. In some experiments, weak Ca?* 
buffer was used instead (0.2mM EGTA and no added CaCl,). A correction of 
+10mV was applied for the subsequent liquid junction potential that arose from 
this glutamate-based internal solution. The composition of the extracellular solu- 
tion was (in mM): NaCl 145, KC] 2.8, CaCl, 10, MgCl, 2, CsCl 10, D-glucose 10, 
HEPES 10, pH 7.4 with NaOH. 

Icrac was measured by applying voltage ramps (— 100 to + 100 mV in 50 ms) at 

0.5 Hz from a holding potential of 0 mV. For fast inactivation, step pulses (250 ms 
duration) were applied from 0 mV to —100 mV every 2s. Currents were filtered 
using an 8-pole Bessel filter at 2.5kHz and digitized at 100 ts. Currents were 
normalized by dividing the amplitudes (measured from the voltage ramps at 
—80 mV) by the cell capacitance. Capacitative currents were compensated before 
each ramp by using the automatic compensation of the EPC-9/2 (HEKA 
Elektronik) patch-clamp amplifier. For Icrac, leak currents were subtracted by 
averaging 2-3 ramp currents obtained just before Icrac had started to develop, 
and then subtracting this from all subsequent currents. 
Ca’* imaging. Ca”* imaging experiments were carried out at room temperature 
(21-24 °C) using the IMAGO CCD camera-based system from TILL Photonics, as 
described previously*’. Cells were alternately excited at 356 and 380 nm (20 ms 
exposures) and images were acquired every 2 s. Images were analysed offline using 
IGOR Pro for Windows. Cells were loaded with Fura 2-AM (2 1M) for 40 min at 
room temperature in the dark and then washed three times in standard external 
solution of composition (in mM) NaCl 145, KC] 2.8, CaCl, 2, MgCl, 2, D-glucose 
10, HEPES 10, pH 7.4 with NaOH. Cells were left for 15 min to allow further de- 
esterification. Ca”* -free solution had the following composition (in mM) NaCl 
145, KCl 2.8, MgCl, 2, D-glucose 10, HEPES 10, EGTA 0.1, pH 7.4 with NaOH). 
EGTA-AM loading. Cells were loaded with EGTA by incubation for 45 min with 
EGTA-AM as described?. 


Confocal microscopy. Cells were fixed in 4% paraformaldehyde in phosphate 
buffer for 30 min at room temperature. All the washes used 0.01% PBS (137 mM 
NaCl, 2.7 mM KCl, 8 mM Naz:HPOg, 1 mM KH2PO,). The cells were blocked with 
2% BSA and 10% goat serum for 1h. Anti-c-Fos, -PKC-6, -PKC-¢ and -PKC-¢ 
were used in carrier (0.2% BSA, 1% goat serum) and left overnight at 4 °C and were 
purchased from Cell Signalling. Anti-PKC-o and -B were used in carrier (0.2% 
BSA, 1% goat serum) and left overnight at 4°C and were purchased from Santa 
Cruz. The secondary anti-rabbit IgG was a HandL chain-specific (goat) fluorescein 
conjugate (Alexa Fluor 568, excitation at 578 nm, emission at 603 nm wavelength) 
from Invitrogen. This was used in PBS for 2 h at room temperature. The cells were 
mounted in Vectashield mounting medium. Images were obtained using a Leica 
confocal microscope, as described’. 

Western blotting. Total cell lysates (50 lug) were separated by SDS-PAGE on a 
10% gel and electrophoretically transferred to nitrocellulose membrane, as 
described’. Membranes were blocked with 5% non-fat dry milk in TBS plus 
0.1% Tween 20 (TBST) buffer for 1h at room temperature. Membranes were 
washed with TBST three times and then incubated with primary antibody over- 
night at 4 °C. Anti-PKC-6, -¢ and -C antibodies were obtained from New England 
Biolabs and used at 1:2,500 dilution. Anti-PKC-«, -B and total ERK2 antibodies 
were purchased from Santa Cruz Biotechnology. PKC-« and -f antibodies were 
used at 1:2,500 dilution and total ERK antibody was used at a dilution of 1:5,000. 
The membranes were then washed with TBST again and incubated with a 1:2,500 
dilution of goat anti-rabbit secondary antibody IgG from Santa Cruz 
Biotechnology for 1h at room temperature. After washing with TBST, the bands 
were developed for visualization using ECL-plus western blotting detection system 
(GE Healthcare). Gels were quantified using the UN-SCAN-IT software package 
(Silk Scientific). Total ERK2 is widely used as a control for gel loading. The 
antibody does not discriminate between phosphorylated (and hence active) and 
non-phosphorylated ERK2 and therefore detects the total amount of this protein, 
regardless of whether the kinase has been activated. The extent of PKC was 
therefore normalized to the total amount of ERK2 present in each lysate, to correct 
for any differences in amount of cells used for each experiment. 

RT-PCR. Total RNA was extracted from RBL cells by using an RNeasy Mini Kit 
(Qiagen), as described*’. RNA was quantified spectrophotometrically by absor- 
bance at 260 nm. Total RNA (1 1g) was reverse-transcribed using the iScript!M 
cDNA Synthesis Kit (Bio-Rad), according to the manufacturer’s instructions. 
Following cDNA synthesis, PCR amplification was then performed using BIO- 
X-ACTTM. Short DNA Polymerase (Bioline) with primers specific for the detection 
of c-fos were synthesized by Invitrogen. The PCR products were electrophoresed 
through an agarose gel and visualized by ethidium bromide staining. 

Statistics. Results are presented as means + s.e.m. Statistical significance was 
assessed using Student’s t-test for comparison between two groups or ANOVA 
followed by a post-hoc Newman Keuls multiple comparison test for the difference 
between groups and results were considered significant at *P < 0.05, **P<0.01. 
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Structures of cytochrome P450 17A1 with prostate 
cancer drugs abiraterone and TOK-001 


Natasha M. DeVore! & Emily E. Scott! 


Cytochrome P450 17A1 (also known as CYP17A1 and cytochrome 
P450c17) catalyses the biosynthesis of androgens in humans’. As 
prostate cancer cells proliferate in response to androgen steroids”’, 
CYP17A1 inhibition is a new strategy to prevent androgen synthesis 
and treat lethal metastatic castration-resistant prostate cancer’, but 
drug development has been hampered by lack of information 
regarding the structure of CYP17A1. Here we report X-ray crystal 
structures of CYP17A1, which were obtained in the presence of 
either abiraterone, a first-in-class steroidal inhibitor recently 
approved by the US Food and Drug Administration for late- 
stage prostate cancer’, or TOK-001, an inhibitor that is currently 
undergoing clinical trials**. Both of these inhibitors bind the haem 
iron, forming a 60° angle above the haem plane and packing against 
the central I helix with the 3B-OH interacting with aspargine 202 in 
the F helix. Notably, this binding mode differs substantially from 
those that are predicted by homology models and from steroids in 
other cytochrome P450 enzymes with known structures, and some 
features of this binding mode are more similar to steroid receptors. 
Whereas the overall structure of CYP17A1 provides a rationale for 
understanding many mutations that are found in patients with 
steroidogenic diseases, the active site reveals multiple steric and 
hydrogen bonding features that will facilitate a better understand- 
ing of the enzyme’s dual hydroxylase and lyase catalytic capabilities 
and assist in rational drug design. Specifically, structure-based 
design is expected to aid development of inhibitors that bind only 
CYP17A1 and solely inhibit its androgen-generating lyase activity 
to improve treatment of prostate and other hormone-responsive 
cancers. 

CYP17A1 is a membrane-bound dual-function monooxygenase 
with a critical role in the synthesis of many human steroid hormones’. 
The 17a-hydroxylase activity of CYP17A1 is required for the genera- 
tion of glucocorticoids such as cortisol, but both the hydroxylase and 
17,20-lyase activities of CYP17A1 are required for the production of 
androgenic and oestrogenic sex steroids (Supplementary Fig. 1). 
CYP17A1 is an important target for the treatment of breast and prostate 
cancers that proliferate in response to oestrogens and androgens””*. In 
the absence of structural information, CYP17A1 inhibitors have been 
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Figure 1 | Function of CYP17A1 and inhibition by clinical compounds. 
a, CYP17A1 titration with abiraterone (10-274 nM) yields progressive shifts in 
the ultraviolet-visible difference spectrum that indicate nitrogen binding to 
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designed that are thought to bind the cytochrome P450 haem iron’, but 
it has been difficult to rationalize or predict other structural features 
that are critical for effective and selective CYP17A1 inhibition. In 
addition, structural information is important for understanding 
17-hydroxylase deficiencies and may also improve our understanding 
of polycystic ovary disease’. We determined the structures of human 
CYP17A1 when bound to two clinically relevant CYP17A1 inhibitors 
(Supplementary Fig. 2). Abiraterone is the active form of a prodrug that 
was recently approved by the US Food and Drug Administration for 
metastatic prostate cancer** and is under investigation for breast cancer”. 
TOK-001 is currently undergoing clinical trials for prostate cancer’. 

A truncated, His-tagged version of the human CYP17A1 protein was 
generated from a synthetic complementary DNA that was engineered 
to remove the single amino-terminal transmembrane helix, and this 
CYP17A1 was expressed in Escherichia coli. The resulting CYP17A1 
was membrane bound and so it was solubilized with detergent before 
purification. This CYP17A1 binds abiraterone (17-(3-pyridyl)androsta- 
5,16-dien-3B-ol) (Fig. la) and TOK-001 (17-(1H-Benzimidazol-1- 
yl)androsta-5,16-dien-3f}-ol) with decreases in absorbance at 402 nm 
and increases in absorbance at 424 nm, consistent with nitrogen binding 
to the haem iron (type II interaction) with dissociation constant (Kq) 
values of <100 nM (Fig. 1a, inset). Similar titrations with the substrates 
progesterone (Fig. 1b) and pregnenolone revealed decreases in 
absorbance at 419 nm and increases in absorbance at 385 nm, which is 
indicative of the ligand displacing water from the haem (type I inter- 
action). CYP17A1 binds pregnenolone (K4 <100 nM) more tightly than 
progesterone (Ky 229 + 14 nM; Fig. 1b, inset). We found that full-length 
enzyme” had a similar k,,, and three-fold higher Michaelis constant 
(keat 1.31 + 0.03 min *, Ky, 11.4 + 0.7 uM) compared to the truncated 
form (keqt 1.31 £0.03 min™', Ky, 3.70.3 UM). The half-maximum 
inhibitory concentration (ICs9) values for abiraterone (201 + 1 nM) 
were lower than for TOK-001 (503 + 1 nM) (Fig. 1c). Thus, truncated 
human CYP17A1 is a functional enzyme in terms of ligand binding, 
catalytic function and inhibition. 

Both structures with abiraterone (2.6 A) and TOK-001 (2.4 A) 
demonstrate the characteristic cytochrome P450 fold (Fig. 2a) 
and have four very similar protein copies in each asymmetric unit 
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haem iron. b, Similar titration with progesterone (10-1535 nM) indicates water 
displacement from the haem iron. ¢, ICso of abiraterone (circles) and TOK-001 
(squares) for progesterone 17«-hydroxylation. 
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(Supplementary Table 1). Consistent with spectral binding data, 
abiraterone and TOK-001 bind with the nitrogen of the C17 pyridine 
or benzimidazole, respectively, forming a coordinate covalent bond 
with the haem iron (Fig. 2b, d). The steroidal cores of these inhibitors 
rise at an angle of 60° above the haem plane, directed between the F and 
G helices (Fig. 2b, d), and essentially overlap with each other (Fig. 2f). 
The unsubstituted o-face packs flat against the I helix where G301, 
A302 and adjacent residues form a highly complementary hydrophobic 
planar surface (Fig. 2b). The 3B-OH groups of abiraterone (Fig. 2b) and 
TOK-001 (Fig. 2d) hydrogen bond with N202 in the F helix (~2.6A 
and ~2.4 A, respectively). 

Although inhibitors occupy the majority of the enclosed active site, 
the void extends beyond these ligands in several directions. First, the 
active site wall nearest the inhibitor B-face is not as complementary to 
the steroid core as to the «-face. The C18 and C19 methyl groups 
project towards a crevice between the B’ helix, the 84 loop and the 
loop following the F helix (Fig. 2b). Only three side chains of the cavity 
wall come within 4 A of C18 or C19. The cavity wall facing the B-face of 
abiraterone or TOK-001 is primarily lined with hydrophobic atoms of 
A105, S106, A113, F114, 1206, L209, V236 and V482 (Fig. 2c), but there 
are two notable exceptions. R239 and D298 extend from the G and I 
helices, respectively, to orient their basic and acidic termini towards 
C6. These two polar side chains flank a substantial extension of the 
active site void adjacent to C6. Second, in the abiraterone structure 
there is unfilled space in the active site cavity adjacent to the pyridine 


N terminus 


Figure 2 | CYP17A1 ligand binding. In the stick and sphere representations, 
non-carbon atoms are indicated in blue (N), red (O), black (haem), grey 
(abiraterone), cyan (TOK-001) and by a dark red sphere (Fe). a, A coloured 
representation of the CYP17A1-abiraterone structure, from the N terminus 
(blue) to the C terminus (red). b, Abiraterone binds (2|F,|-|F.| density at 10; 
blue mesh) at an angle of approximately 60° from haem against helix I (yellow). 
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ring bordered by V366, A367, 1371 and V483 (Fig. 2c), and this volume 
is occupied by benzimidazole in the TOK-001 structure (Fig. 2e). Last, 
the most substantial active site cavity extension is from the 3B-OH of 
the inhibitors over the top of helix I and along the underside of helices 
F and G. This cavity is mostly lined by hydrophobic residues (1198, 
L243 and F300), but its ‘roof is bordered by several polar F and G helix 
residues (Y201, N202 and R239; Fig. 2c) that interact with, or are 
located near, water molecules in this region. The overall cavity contain- 
ing TOK-001 is similar to that observed when abiraterone is bound, 
except that the available void volume over helix I is slightly smaller 
(Fig. 2e). 

The single direct hydrogen bond between inhibitors and the protein is 
part of a larger hydrogen bonding network. In the abiraterone complex 
this network involves N202, E305, several conserved water molecules, 
R239, the backbone carbonyl of G297 and, in some molecules, Y201 
(Fig. 3). Although Y201 is not within hydrogen bonding distance to 
these waters for molecules A and B, the side chain rotates slightly 
towards abiraterone in molecules C and D to interact with one or both 
of the water molecules. TOK-001 has a very similar hydrogen bonding 
network (Supplementary Fig. 3). These interactions are strongly 
reminiscent of the interactions that are conserved in the androgen, 
oestrogen, glucocorticoid, mineralocorticoid and progesterone recep- 
tors'! (Fig. 4a, b). In each receptor, the 3B-OH or 3-keto group of 
steroids binds within a deep receptor pocket and forms hydrogen bonds 
with a glutamine or glutamate, an arginine and often a conserved water 


c, Abiraterone cavity (grey mesh), rotated approximately 180° in relation to 
b. d, TOK-001 binding (2| F,|-|F.| density at 1o, blue mesh). e, TOK-001 
cavity (grey mesh). f, Abiraterone structure (yellow) superimposed on TOK- 
001 structure (purple) with respective voids (mesh). The B’ helix has been 
removed from panels c-f so that it is possible to view the ligands. 
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Figure 3 | Hydrogen bond network with abiraterone. CYP17A1 has a 
hydrogen bonding network at the top of the active site that interacts with 
abiraterone. Molecules A and B (yellow), and C and D (green) have slightly 
different networks, with the main difference being the involvement of Y201. 
Water molecules are indicated by small spheres. Hydrogen bonds are 
represented by dashed lines and the distances between them are indicated. 


molecule. These interactions are critical for ligand recognition by 
hormone receptors” and may also contribute to CYP17A1 selectivity 
for pregnenolone, progesterone and their 17x-hydroxy derivatives. 
Notably, TOK-001 is both a CYP17A1 inhibitor and androgen receptor 


2.9 A I~ 


3.3Ad3.4A ( 


K-L loop 


Figure 4 | CYP17A1 compared to the androgen receptor and CYP11A1. 

a, The hydrogen bonding network near the abiraterone 3B-OH involves N202, 
R239 and conserved waters. b, The androgen receptor (PDB 3L3X) has a similar 
hydrogen bond network with R752, Q711 and several waters that interact with the 
dihydrotestosterone ketone. c, The structure of CYP17A1 with abiraterone (yellow) 
superimposed on the structure of CYP11A1 with 20,22-dihydroxycholesterol 
(PDB 3NA0, pink). This shows that there are markedly different steroid 
orientations between the two structures. 
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antagonist’’, and the similarity of these binding modes is probably the 
reason for this dual mechanism of action. 

Orientations of the native CYP17A1 substrates are of substantial 
value in understanding the function of this enzyme. Pregnenolone and 
progesterone were docked into the CYP17A1-abiraterone structure 
modelled as the Fe(1v) =O (compound I) catalytic state. Progesterone 
maintained the N202 hydrogen bond. The distances from C17 and 
C16 to the catalytic oxygen were 3.7A and 3.9A, respectively, 
distances that are consistent with the experimentally observed 
major 17a%-OH and minor 16a%-OH progesterone metabolites. The 
pregnenolone 3B-OH hydrogen was also bonded to N202 and the 
C17 atom was 3.6 A from the Compound I oxygen (Supplementary 
Fig. 4). However, the active site topology may be altered in the presence 
of substrates and this is an important area for further investigation. 

CYP17A1 can be compared to three other P450 enzymes that are 
involved in steroidogenesis or cholesterol metabolism with reported 
steroid complex structures: CYP19A1 (aromatase)'*, CYP11A1 (cho- 
lesterol side-chain cleavage enzyme)'* and CYP46A1, a cholesterol 24- 
hydroxylase’’. Although all four enzymes maintain the canonical 
cytochrome P450 fold, CYP19A1, CYP11A1 and CYP46A1 orient 
steroids in the opposite direction from CYP17A1 so that the steroid 
ligands are positioned over the K-L loop directed towards the B1 sheet 
(as shown for CYP11A1 in Fig. 4c) instead of oriented towards helices 
F and G. 

More than 50 CYP17A1 mutations have been identified, and most 
of these were identified in patients with 17-hydroxylase deficiencies. 
The biochemical effects of many clinical mutations can be rationalized 
by examining the CYP17A1 structure (Supplementary Fig. 5 and 
Supplementary Table 2). Mutations R96W, R125Q, H373D/N and 
R440H/C all alter residues that directly interact with the haem 
propionates and probably disrupt haem binding, resulting in the com- 
plete loss of activity that was observed’’”’. Mutations E305G, R347H/ 
C, R358Q and R449A eliminate only the lyase activity of CYP17A1***°. 
E305 forms a hydrogen bond to N202 in the active site, indicating a role 
in substrate positioning, whereas the other residues are on the proximal 
face of the protein (Supplementary Fig. 5), and this is consistent with 
their proposed role in cytochrome b; binding”’”®, which promotes lyase 
activity. Finally, reduction of the minor 16a-hydroxyprogesterone 
metabolite is reported for the artificial mutation A105L”’, consistent 
with its location in the active site facing the B-face where the additional 
bulk may reduce the steroid movement within the active site. 

Abiraterone and TOK-001 have several features that make them 
effective inhibitors of CYP17A1: first, a heterocyclic nitrogen that coor- 
dinates to the haem iron; second, a planar «-face to pack against the I 
helix; and third, hydrogen bonding interactions of 3B-OH with con- 
served polar residues in a hydrogen binding network. These structures 
provide a model for the binding of substrates and other inhibitors that is 
very different from binding orientations that have previously been pro- 
posed by homology modelling and docking studies and from structures 
that have been demonstrated in other steroid-metabolizing cytochrome 
P450 enzymes. Perhaps most importantly, the cavity that is observed is 
not bilobed as predicted by many modelling studies*”’. CYP17A1 
interactions with these inhibitors are instead more reminiscent of 
steroid binding to steroid receptors, and this may be the origin of 
the TOK-001 dual mechanism of action. Thus, these structures con- 
tribute to a better understanding of the function and inhibition of 
CYP17A1 ina way that should substantially benefit the apprehension 
of enzyme dysfunction in clinical disease and enable structure-based 
drug design of CYP17A1 inhibitors for treating hormone-responsive 
cancers, particularly prostate cancer. 


METHODS SUMMARY 


A synthetic cDNA for human CYP17A1 was modified to delete residues 2-19, 
substitute the hydrophilic sequence Arg 20-Arg 21-Cys 22-Pro 23 with Ala 20-Lys 
21-Lys 22-Thr 23 and add a carboxy-terminal four-histidine tag (Supplementary 
Fig. 6) before cloning into the pCWori* plasmid and overexpression in Escherichia 
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coli JM109 cells. Protein was purified by nickel affinity, cation exchange and size 
exclusion chromatography. Abiraterone was synthesized (see Methods). Binding 
affinities were determined using an ultraviolet—visible spectral shift assay. 
Progesterone 170-hydroxylation was evaluated using HPLC separation and ultra- 
violet detection. For crystallography, inhibitors were included throughout puri- 
fication. Crystals were grown from CYP17A1 (30 mg ml‘) complexed with 
inhibitor using hanging-drop vapour diffusion to equilibrate against 30% PEG 
3350, 0.175M Tris, pH8.5, 0.30M ammonium sulphate and 3% glycerol. 
Diffraction data were collected and phased by molecular replacement. Iterative 
model building and refinement generated the final model. Substrates were docked 
using Surflex-Dock. 


Full Methods and any associated references are available in the online version of 
the paper at www.nature.com/nature. 
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METHODS 

Synthesis and characterization of abiraterone. A stirred solution of 17- 
iodoandrosta-5,16-dien-3f-ol (600 mg, 1.5 mmol) in tetrahydrofuran (20 ml) in 
a 100-ml round-bottomed flask was purged with argon. Bis(triphenylphosphine) 
palladium (II) chloride catalyst (11 mg, 0.016 mmol) and then diethyl(3-pyridyl) 
borane (265 mg, 1.8mmol) were added. To the resultant orange solution, an 
aqueous solution of sodium carbonate (2 M, 5 ml) was added. The flask was fitted 
with a reflux condenser and the apparatus was purged again with argon. The 
mixture was then heated under reflux (at ~80 °C) and stirred for 4 days then 
allowed to cool. The mixture was poured into water and extracted with hot toluene 
(3 X 30 ml). The toluene extracts were dried (using NaxCO3) and concentrated. 
Column chromatography was performed with Et,O and toluene (with a ratio of 
1:2) as the eluent to give abiraterone (350 mg, 66%) as a white crystalline solid: 
melting point 228-230 °C; IR O-H stretching frequency Vinax 3307 cm '; 'H NMR 
peak shifts in ppm (6) 1.07 (s, 3, H-19), 1,09 (s, 3, H-18), 3.54 (m, 1, H-3«), 5,41 
(dm, 1, J = 5.2 Hz, H-6), 6.01 (m, 1, H-16), 7.24 (dd, 1, pyridyl H-5), 7.66 (dd, 1, 
pyridyl H-4), 8.47 (dd, 1, pyridyl H-6), 8.63 (d, 1, pyridyl H-2); "CC NMR 8 151.69, 
147.92, 147.84, 141.19, 133.68, 132.98, 129.24, 123.03, 121.32, 71.65, 57.56, 50.36, 
47.34, 42.32, 37.19, 36.71, 35.26, 31.81, 31.64, 30.45, 20.88, 19.35, 16.59. The high 
resolution mass spectrum calculated the mass to charge ratio (m/z) C24H32NO 
[M+H]* to be 350.2484. The experimental value was 350.2491. Abiraterone was 
99% pure by liquid chromatography mass spectrometry. 

CYP17A1 design, expression and purification. The human CYP17A1 cDNA 
was synthesized with codon optimization for E. coli expression (Blue Heron 
Biotechnology). A truncated and His-tagged construct was generated by truncation 
of the N-terminal transmembrane helix (A2-19), substitution of Arg 20-Arg 21- 
Cys 22-Pro 23 with Ala 20-Lys 21-Lys 22-Thr 23, and addition of a C-terminal 
four-residue histidine tag (Supplementary Fig. 6). N-terminal modifications were 
designed to increase solubility. This altered cDNA was inserted into the pCWori* 
expression vector and expressed in E. coli JM109 cells. 

Cells were grown, collected and disrupted, as described previously'**. After 
centrifugation (5,000g), CYP17A1 was solubilized with either 4.8 mM Cymal-5 
(for crystallography; Affymetrix) or 2% Emulgen 913 (for assays; Desert 
Biologicals) and then underwent ultracentrifugation (80,000g) for 60 min. The 
lysate was loaded onto a nickel nitriloacetic acid agarose (Qiagen) column and 
purified, as reported elsewhere*’. Eluted CYP17A1 fractions were pooled, diluted 
fivefold with buffer A (50mM Tris, pH 7.4, 20% glycerol, 100mM glycine and 
1mM EDTA) and loaded on a HiTrap CM fast flow column (GE Healthcare). 
Protein was eluted in buffer A with 0.5 M NaCl, concentrated to ~1 ml, and loaded 
on a Superdex 200 16/60 size exclusion column (GE Healthcare). For crystal- 
lography, 10 1M abiraterone or TOK-001 (Shanghai Haoyuan Chemexpress) 
was added to all buffers. 

Protein crystallization, data collection and structure determination. CYP17A1 
crystals were grown using hanging-drop vapour diffusion equilibration. CYP17A1 
(30mg ml’) in buffer A supplemented with 101M abiraterone or TOK-001, 
2.4mM Cymal-5 and 0.5M NaCl was equilibrated against 30% PEG 3350, 
0.175M Tris, pH 8.5, 0.30M ammonium sulphate and 3% glycerol at 20°C. 
Crystals were cryoprotected in 7:1 mother liquor:ethylene glycol and flash cooled 
in liquid nitrogen. Native data sets were collected at 0.98 A, 100 K at the Stanford 
Synchrotron Radiation Laboratory beamline 9-2. Data were processed using 
Mosflm*' and Scala*’. The abiraterone complex was solved by molecular replace- 
ment using BALBES* with a final search model based on CYP2R1 (PDB 3CZH), 
and the TOK-001 structure was solved using Phaser” with the abiraterone structure 
as a search model. Model building and refinement were accomplished iteratively 
using COOT™ and Refmac5* in CCP4”*. Structure validation was performed using 
WHATCHECK” and PROCHECK”. Ramachandran plot analysis reveals per cent 
favourable/additional allowed/generously allowed/disallowed residues are 86.3/ 
13.2/0.5/0.0 (abiraterone structure) and 86.2/13.3/0.5/0.0 (TOK-001 structure). 
X-ray statistics are provided (Supplementary Table 1). Probe-occupied voids were 
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calculated using VOIDOO* (probe radius = 1.4 A; grid mesh = 0.4 A). All figures 
were prepared using MacPyMOL”. 
Docking. The CYP17A1 active site was defined as described for other cytochromes 
P450 (ref. 40) with the addition of an oxygen molecule directly coordinated to the 
haem to mimic compound I of the cytochrome P450 catalytic cycle. Substrate 
coordinates were prepared and energy was minimized with SYBYL (Tripos 
International). Charges were assigned using the Gasteiger and Marsili method. 
Surflex-Dock"' (Tripos International) was used to dock ligands as previously 
described**. The active site was a 10 A sphere around the haem and pregnenolone. 
Movement of pregnenolone within the active site was not substantial, with the 
distances from C17 to O= Fe(1v) as 4.5 A, from C16 to O=Fe(1v) as 4.5 A, and from 
C21 to O=Fe(1v) as 3.0 A for the lowest energy pose. 
Enzyme activity and IC;9 determinations. Progesterone 170-hydroxylation was 
evaluated using a modified HPLC method with ultraviolet detection’. CYP17A1 
(50 pmol) and rat NADPH-cytochrome P450 reductase“ were mixed with a ratio 
of 1:4, incubated on ice (20 min) and added to buffer (50 mM Tris, pH7.4 and 
5mM MgCl,) containing progesterone (0-50 1M) to a total volume of 500 pl. 
Phosphatidylcholine (25 ug) was included for side-by-side kinetic comparisons 
with the full-length enzyme'®. For ICs» determinations, inhibitor concentrations 
were 0-1.5 11M for abiraterone and 0-3.0 1M for TOK-001. After warming (37 °C, 
3 min), reactions were initiated by the addition of NADPH (20 ul 25 mM), incu- 
bated for 10 min (37°C), quenched with 20% trichloroacetic acid (300 ul) and 
placed on ice. The 17a-hydroxyprogesterone metabolite was identified by ultra- 
violet detection at 248 nm after HPLC separation and co-eluted with authentic 
standards. The HPLC mobile phase was 40% acetonitrile, 60% water with 1% 
acetic acid and run at 1 ml min | (Luna (Phenomenex), 5 jum, C18, 50 X 4.6 mm). 
Ligand binding assays. Ligand binding assays were based on spectral differences 
that were detected after ligand titration and were performed as described”*, except 
that the CYP17A1 concentration was 0.1 UM, the path length was 5 cm and the 
tight binding equation was used. 

Functional data were analysed using Prism (GraphPad Software) and presented 
as mean + standard error. 
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Matter with a high energy density (> 10” joules per cm’) is prevalent 
throughout the Universe, being present in all types of stars’ and 
towards the centre of the giant planets”’; it is also relevant for 
inertial confinement fusion*. Its thermodynamic and transport 
properties are challenging to measure, requiring the creation of 
sufficiently long-lived samples at homogeneous temperatures and 
densities**. With the advent of the Linac Coherent Light Source 
(LCLS) X-ray laser’, high-intensity radiation (>10'’ watts per 
cm”, previously the domain of optical lasers) can be produced at 
X-ray wavelengths. The interaction of single atoms with such intense 
X-rays has recently been investigated*. An understanding of the 
contrasting case of intense X-ray interaction with dense systems is 
important from a fundamental viewpoint and for applications. Here 
we report the experimental creation of a solid-density plasma at 
temperatures in excess of 10° kelvin on inertial-confinement time- 
scales using an X-ray free-electron laser. We discuss the pertinent 
physics of the intense X-ray-matter interactions, and illustrate the 
importance of electron-ion collisions. Detailed simulations of the 
interaction process conducted with a radiative-collisional code show 
good qualitative agreement with the experimental results. We 
obtain insights into the evolution of the charge state distribution 
of the system, the electron density and temperature, and the time- 
scales of collisional processes. Our results should inform future 
high-intensity X-ray experiments involving dense samples, such as 
X-ray diffractive imaging of biological systems, material science 
investigations, and the study of matter in extreme conditions. 

The experiment was performed at the LCLS soft X-ray materials 
science instrument (SXR), where a 1.0-um-thick aluminium (Al) foil 
was irradiated with 80-fs X-ray pulses at photon energies in the range 
1,560-1,830 eV (at and above the Al K edge). The LCLS pulse con- 
tained ~10'” photons with an energy bandwidth of ~0.4%. The X-ray 
pulse was focused by means of bendable Kirkpatrick-Baez mirrors, and 
the focal spot was characterized ex situ by analysing imprints in lead 
tungstate (ref. 9), indicating a spot size of 9.1 + 0.8 um’, that is, a peak 
intensity of 1.1 X 10'7 Wcm ”. Aluminium was studied because it is a 
prototypical free-electron metal, presenting all the intricacies of a high- 
electron-density system, yet possessing a relatively simple atomic 
structure. Our main target diagnostic is X-ray emission spectroscopy, 
sensitive to recombination from the L to the K shell, that is, to Al Ka 
emission, in the spectral range 1,460-1,680 eV. 

The absorption process in the Al foil is dominated by K-shell photo- 
absorption, ejecting a core electron into the continuum, because the 
cross-section for L-shell and valance-band photoionization is over a 
factor 10 smaller. After photoionization, the filling of the K-shell hole 
proceeds mainly by KLL Auger decay, producing an ion with two holes 
in the L shell. Radiative emission accounts for the remaining 3.8% of 


the total recombination from the L shell’®. In Fig. 1 we plot the spectrally 
resolved X-ray emission over a range of excitation photon energies 
between 1,560 and 1,830eV. The spectra show the main Ka peak 
around 1,487 eV, followed by a series of peaks corresponding to emis- 
sion from higher charge states due to a growing number of L-shell holes. 

The observation of Ka emission from highly charged ions in a dense 
system is not of itself novel, being routinely observed when charged- 
particle beams (such as ions or optical-laser-generated energetic elec- 
trons) interact with sold targets''"'’. However, our results differ from 
these experiments in two important ways. First, in our experiment the 
K-shell holes are created exclusively by intense, quasi- monochromatic 
X-ray photons rather than by collisions with broadband energetic 
particles. The main absorption process (creating K-shell holes) will 
therefore only take place in a particular ion if the photon energy lies 
above that ion’s K edge, or, given that the X-rays are so intense, ifa K 
electron is resonantly pumped to a vacant L-shell state. It follows that 
the X-ray free-electron laser (FEL) acts not only as a pump—exciting 
electrons and heating the system—but also as a selective probe, in that 
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Figure 1 | Spectrally resolved Ka, emission as a function of the X-ray FEL 
excitation photon energy. The colour coding (bar on right) refers to the 
emission intensity on a logarithmic scale. Roman numerals (top) indicate the 
charge state of the emission peak: red, for states with a single K-shell hole; blue, 
for states with a double K-shell hole. Peaks around the resonance line (dashed 
white line, indicating where the FEL photon energy equals the emitted photon 
energy) correspond to emission from resonantly-pumped K-L transitions. 
Open circles, K edges for the various charge states calculated in the SCFLY code, 
which includes the ionization potential depression in the dense plasma 
according to a modified version of the Stewart-Pyatt model”'”. 
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imi- K-shell photoionization 

mN- KLL Auger recombination 
(e)- L-shell collisional ionization 
igj- K-L resonant excitation 


Figure 2 | Experimental and theoretical spectra, 
and charge-state emission pathways. Left, 
calculated emission spectra (dashed lines) 


according to the SCFLY code for four X-ray 


energies (a, 1,830 eV; b, 1,750 eV; c, 1,660 eV; 

d, 1,580 eV), compared with the experimental 
results presented in Fig. 1 (solid lines). The two 
simulation curves correspond to different X-ray 
intensity distributions on target: a Gaussian profile 
and the measured F-scan profile. Also shown 
(right) are the various pathways leading to 
emission from an ion with five L-shell holes in the 
final state (VIII), observed as the fifth emission 
peak: these pathways depend strongly on the 
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wavelength of the X-ray excitation. Rx and Ryxy 
correspond to emission from resonant transitions 
between K- and L-shell states in an ion containing 
seven or eight L-shell holes, respectively (states X 
and XI). Colour bars below each plot on left 
indicate the dominant mechanism contributing to 
the emission intensity for the various charge states 
(Roman numerals). a.u., arbitrary units. 
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the K-shell holes and subsequent Ka emission only occur from a select 
range of ion charge states, depending on the FEL photon energy. As 
shown in Fig. 1, the range of ions from which we see K-shell emission is 
indeed strongly dependent on the energy of the incident photon: at 
photon energies just above the neutral K edge only a limited number of 
ionization stages is observed, whereas at the highest photon energy of 
1,830 eV we observe all possible ionization stages corresponding to the 
filling of a single K-shell hole. In addition, at specific photon pump 
energies, we observe emission from resonantly pumped single- and 
double-hole K-shell states, marked as resonance transitions in Fig. 1. 
We label all emission peaks according to their K- and L-shell charge 
state (the three M-shell electrons are free-electron-like in ground-state 
Al): that is, the main Ko peak corresponds to charge state IV and the 
higher energy peaks correspond to states in the range V-XI. The 
second important difference to note is that in our experiment the 
population of L-shell states, and the charge state distribution (CSD), 
is seen to be determined predominantly by electron-ion collisional 
processes within the heated dense system, rather than directly by the 
X-ray pulse. Therefore, provided the photon energy is sufficiently high 
to satisfy the probe condition described above for all charge states, the 
Ka emission spectrum contains information on the CSD of the hot 
system, on the electron temperature and on electron-ion collision 
rates (best observed in the case of resonances where not one but several 
emission peaks are observed at a single pump energy). 


2 | NATURE | VOL 000 | 00 MONTH 2012 


This pivotal role of collisions in determining the observed spectra 
and the CSD is evinced by two main features in the data. The first is the 
relatively uniform intensity of the Ka emission as a function of charge 
state. For isolated atoms, successive sequential photoionization of the 
K-shell occurs, with the dominant ionization pathway being a P-A-P- 
A... process (where P indicates photoionization, and A Auger decay)*. 
As each P-A sequence results in two electrons being ejected from the L 
shell, the populations of ions with an even number of ejected electrons 
dominate over those with an odd number ejected. In contrast, for the 
solid-density system, the dynamics are distinctly different, and we 
observe no odd-even asymmetry in the Ka emission. This is due to 
the high electron-ion collision rates, which compete with both the P 
and A processes. Our simulations (below) show that the collisional and 
Auger rates are comparable at the peak of the pulse at the highest 
intensities. The second piece of evidence for the important role of 
collisions is the observation of emission corresponding to resonantly 
pumped single and double-core-hole states in ion stages for which 
direct photo-ionization of a K-shell electron is energetically forbidden. 
As the number of electrons in the L shell decreases, the reduced screen- 
ing results in tighter binding of the K electrons, increasing the K-edge 
energy as a function of ion stage (see Fig. 1). Thus, for a particular 
photon energy, the ejection of a K electron is only possible up to a 
certain ion stage, explaining the observed fundamental trend where Ko 
emission is seen from higher charge states as the photon energy is 
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increased. Nevertheless, this principle can be violated, as the density is 
so high that additional collisional ionization occurs, promoting further 
L electrons to the continuum (in addition to those ionized by Auger 
recombination). Such collisional ionization takes place for ions with 
two K electrons present and, importantly, is also sufficiently rapid that 
it can produce further L-shell vacancies in ions already containing a 
K-shell hole (that is, after photoabsorption, yet before Auger or 
radiative decay). In the former case, ions with K-edge energies higher 
than the incident photon energy can be created, but further photo- 
excitation of K-shell electrons to the continuum for these ions is no 
longer possible, as the K edge exceeds the photon energy. In the latter 
case, two interesting effects arise. 

First, some Ka emission is still seen from ion stages which have 
K-edge energies higher than the photon energy, via a process whereby 
photo-excitation of the K shell occurs at a lower ion stage, collisions 
ionize further L-shell electrons in the presence of the K-shell hole, then 
radiative decay occurs producing Ka emission. This effect can be seen 
in Fig. 1 by comparing the spectrum for a given LCLS photon energy 
with the calculated energy of the K edge for a particular ion stage. 
Although accurate independent measurements of the K-edge positions 
in these hot-dense conditions are not available (we note that our results 
provide some information in this regard, albeit blurred by strong 
collisional effects), forcing us to rely on computational modelling, 
we note that in order to explain the spectra in terms of a K-edge shift 
alone in the absence of collisional ionization, the calculated shifts 
would need to be incorrect by several hundred electron volts for the 
highest charge states. Such a large discrepancy is implausible, but more 
importantly, would also lead to severely inconsistent simulation results 
for the emission spectrum. We further note that this observation also 
cannot be explained by multi-photon ionization, which scales with the 
square of wavelength, and is negligible even at these intensities in the 
X-ray regime. 

Second, resonant photo-pumping of K-shell electrons to the L shell 
(rather than to the continuum) is energetically permitted for ions with 
both a single and a double K-shell hole in the final state. Radiative 
decay of these states produces the resonance emission highlighted in 
Fig. 1. Such emission appears prominently for states with two holes in 
the K-shell (marked as V-X for X-ray excitation in the range 1,600- 
1,680 eV), as well as from highly ionized states with a single K-shell 
hole (emission from peaks X and XI at excitation energies around 
1,580 eV). In particular, we note that several peaks are observed on 
resonance rather than just one, indicating that the collisional rates are 
similar to, or larger than, Auger recombination rates. 

To further interpret our results, we have performed time-dependent 
population kinetics calculations, using the SCFLY code’*. These 
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simulations provide atomic level populations, CSDs, emission spectra, 
total absorbed energies and electron temperatures. The calculations 
are based on a rate-equation model that includes sequential single- 
photon absorption, spontaneous and stimulated emission, collisional 
excitation, ionization and their inverses, as well as the effects of opacity 
via an escape-factor formalism. We plot the calculated spectra for four 
X-ray excitation energies compared with our experimental measure- 
ments in Fig. 2. The interaction processes leading to the observed 
emission from a specific charge state (VIII) are shown alongside for 
the four X-ray photon energies, illustrating the different pathways 
resulting from a combination of direct K-shell photoionization, elec- 
tron collisional ionization and resonant K-L transitions. We note that 
the emission intensity from the various charge states is not trivially 
related to the population of those charge states. 

The simulated spectra successfully reproduce the experimental 
trends in a qualitative manner, however, quantitative differences can 
still be observed for several charge state emission intensities. This is in 
part due to uncertainties in the experimental intensity distribution on 
target, which is challenging to accurately account for (illustrated by the 
two simulation curves in Fig. 2, corresponding to different spatial 
profiles), in part to the simplifications made necessary for an efficient 
atomic-kinetics code, but mostly to uncertainties regarding fun- 
damental physical parameters in warm and hot dense systems, such 
as the rates for inner-shell transitions, collisional ionization, Auger 
recombination in highly ionized hollow atoms, and the ionization 
potential depression. This last point regarding physical parameters is 
particularly acute in the case of emission from double-core-hole states, 
where the agreement between experiment and simulation is poorest. 
Nonetheless, given the qualitative agreement with the spectra and the 
straightforward energy absorption mechanism, the combined effect of 
these uncertainties on the basic thermodynamic quantities is not 
expected to be significant. The calculated electron temperature, free- 
electron density and CSD are shown in Fig. 3. Temperatures in excess 
of 100 eV are found, which do not depend strongly on the excitation 
wavelength. Further, higher X-ray photon energies invariably yield 
higher temperatures by the end of the pulse, despite having an initially 
lower absorption cross-section. This can be explained by thermal col- 
lisions governing the CSD: as the temperature rises, low charge states 
are depleted and higher-energy X-rays, which couple more efficiently 
to higher charge state transitions, deposit more energy in the system. 
To quantitatively evaluate this collisional effect, we illustrate in Fig. 4 
the photoionization, Auger recombination and collisional ionization 
rates as a function of time for transitions populating and de-populating 
the single-core-hole charge state VII. We observe that by the peak of 
the pulse, at temperatures above 100eV, electron-ion collisional 


Figure 3 | Temperature, density and charge state 


distribution at peak intensity. a, Calculated 
charge state distribution, integrated over the 
duration of a temporally Gaussian pulse. The 
calculations correspond to the centre of the X-ray 
spot, at the highest spatial intensities. b, c, The 
temperature (b) and density (c) of the electronic 
system over a range of excitation energies. The peak 
of the X-ray pulse is centred at 80 fs, with a FWHM 
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Figure 4 | Rates for atomic and collisional processes. Comparison between 
K-shell photoionization, collisional ionization and Auger recombination rates 
(broken curves, left-hand vertical axis) involved in populating and depopulating 
charge state VII at an X-ray photon energy of 1,830 eV. This state is one of the 
most populated for conditions leading to a peak temperature of ~100 eV. The 
calculation is conducted with a Gaussian beam (80 fs FWHM) centred at 80 fs. 
The electron temperature is given by the solid curve with grey shading under 
(right-hand vertical axis). By the peak of the pulse at 80 fs, the collisional 
ionization rate exceeds all others, dominating the population dynamics. 


processes dominate the evolution of the CSD, and the dynamics of the 
system is determined by its temperature rather than by the intense 
X-ray excitation field. This result is fully consistent with our experi- 
mental results shown in Fig. 1. 

Finally, in the context of creating well-defined states of matter in 
extreme conditions, we note that the duration of the pulse, heating of 
the electrons, photoionization and Auger decay are all rapid processes 
compared with atomic motion and hydrodynamic expansion of the 
target. For example, an aluminium ion at 150 eV travels only 5 nm in 
100 fs, and thus a crude estimate of the hydrodynamic expansion time 
would be 20 ps for a 1 um target. Given that electron-ion relaxation 
times are expected to be less than 10 ps (ref. 19), these results bode well 
for the creation of uniform-density hot and warm dense matter. 


METHODS SUMMARY 


A schematic illustration of the experimental set-up is given in Supplementary Fig. 1. 
The spectroscopy was conducted by means of a flat ammonium dihydrogen 
phosphate crystal (ADP) in the (101) orientation with a spectral resolution of 
0.9-1.3 eV across the photon energy range of interest. All spectroscopic data were 
collected in single-shot acquisition mode, with a fresh region of the target exposed 
for every shot. Figure 1 was obtained by binning a total of around 550 single-shot 
spectra according to the measured X-ray photon energy in 5-eV bins. 

Simulations of the experimental emission were conducted using an adapted 
version of the collisional-radiative code SCFLY'*, which is based on the widely 
available FLYCHK simulation suite. SCFLY solves the rate equations for non- 
LTE (local thermal equilibrium) plasmas using a super-configuration description 
of the atomic structure of the system and provides a spectral module, which uses 
the computed level populations to calculate the X-ray emission spectra. Several 
important improvements have been included to make SCFLY applicable to X-ray 
FEL-driven systems: an increased number of super-configurations, the inclusion 
of hollow atoms and a self-consistent, absorption-driven temperature calculation. 
Each calculation provides an emission spectrum for a single X-ray intensity. To 
account for the experimental intensity distribution on target, 26 simulations 
covering the full range of probed intensities were computed for each X-ray excita- 
tion energy. The final spectra are obtained by averaging these over the three- 
dimensional interaction region, weighted according to the intensity distribution 
profile measured by the imprints technique in lead tungstate (PbWO,). This 
intensity distribution, named the F-scan profile, which is obtained following the 
method described in ref. 9, is compared to the Gaussian intensity distribution 
consistent with the measured spot size (full-width at half-maximum, FWHM) in 
Fig. 2. 
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Full Methods and any associated references are available in the online version of 
the paper at www.nature.com/nature. 
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METHODS 


Experimental set-up. During the experiment, the total pulse energy was recorded 
on a shot-to-shot basis using a gas detector in the LCLS front-end enclosure. We 
measured an average pulse energy of 0.8-1.4mJ, depending on the photon 
wavelength. Approximately 30-34% of this energy is delivered to the target owing 
to the transmission of the beamline optics between the gas detector and the 
Resonant Coherent Imaging (RCI) experimental chamber’’. During the course 
of the experiment, we scanned a photon energy range of 1,580-1,830 eV. The 
photon energy values are derived from the electron beam energies provided by the 
LCLS online diagnostics, the values of which were calibrated at the Al K edge both 
before and after the data acquisition. This method provides an accuracy of better 
than 5 eV in the absolute calibration, and is approximately constant over the entire 
range of energies investigated. 

Data analysis. The noise filtering for the data analysis was conducted by analysing 
the number of counts per CCD (charge-coupled device) pixel: photons of 1,600 eV 
are seen to produce approximately 150 counts above the thermal noise level on the 
CCD detector, which is sufficient to isolate the signal and reduce the noise con- 
siderably. Any change in beam position on the sample will alter the crystal diffraction 
angle slightly, causing a shift in where wavelengths are mapped on to the detector. We 
have accounted for this positioning jitter effect by cross-correlating each single shot 
with the average spectrum and adjusting the positions of the Ko satellite peaks 
accordingly to overlap. The wavelength axis of the final spectrum is then calibrated 
by fitting the positions of the main Ko peak and its satellites to previously published 
values’’. This eliminates the need to use geometric considerations to map the detector 
pixels to wavelength and is considerably more accurate. We did not measure the 
temporal profile of the LCLS beam or the pulse length. Instead, the LCLS electron 
bunch length was measured (80 fs FWHM) and assumed to be a good indication of 
the X-ray pulse duration. However, this value should only be considered as an upper 
bound on the pulse duration, with recent experiments reporting results consistent 
with 2-3 times shorter pulses*”*. 
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Modelling. Simulations of the experimental emission were conducted using the 
SCELY code'*” adapted to simulate FEL-driven systems. It has recently been 
reported that these modifications enable SCFLY to successfully simulate X-ray- 
matter interactions in the non-collisional regime limit”. For the work presented 
here, which extends this study to the highly collisional regime, a complete set of 
super-configurations for all ionic species of aluminium up to the n = 3 shell are 
considered in the code. We have further verified that the spectrum exhibits a 
negligible dependence on the temporal structure of the X-ray pulse, provided 
the total fluence is constant, by comparing calculated spectra obtained by using 
a Gaussian temporal profile with a simulated self-amplified spontaneous emission 
FEL pulse of equal duration and fluence, following ref. 26. 

The energies used in the simulations are calculated according to the relativistic 
configuration average model following a Dirac-Hartree-Slater method”. The 
agreement in peak position that can be observed between simulation and experi- 
ment, within ~3 eV (0.2%) for the highest charge states, was deemed sufficient 
within the context of our investigation. It can however be improved on by using a 
fine structure model for the energy levels, albeit at a significant additional com- 
putational cost. 
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Gated regulation of CRAC channel ion selectivity 


by STIMI1 


Beth A. McNally', Agila Somasundaram', Megumi Yamashita’ & Murali Prakriya' 


Two defining functional features of ion channels are ion selectivity 
and channel gating. Ion selectivity is generally considered an 
immutable property of the open channel structure, whereas gating 
involves transitions between open and closed channel states, 
typically without changes in ion selectivity’. In store-operated 
Ca** release-activated Ca?* (CRAC) channels, the molecular 
mechanism of channel gating by the CRAC channel activator, 
stromal interaction molecule 1 (STIM1), remains unknown. 
CRAC channels are distinguished by a very high Ca’* selectivity 
and are instrumental in generating sustained intracellular calcium 
concentration elevations that are necessary for gene expression and 
effector function in many eukaryotic cells’. Here we probe the 
central features of the STIMI1 gating mechanism in the human 
CRAC channel protein, ORAII, and identify V102, a residue 
located in the extracellular region of the pore, as a candidate for 
the channel gate. Mutations at V102 produce constitutively active 
CRAC channels that are open even in the absence of STIM1. 
Unexpectedly, although STIM1-free V102 mutant channels are 
not Ca**-selective, their Ca’* selectivity is dose-dependently 
boosted by interactions with STIM1. Similar enhancement of 
Ca’* selectivity is also seen in wild-type ORAI1 channels by 
increasing the number of STIM1 activation domains that are 
directly tethered to ORAII channels, or by increasing the relative 
expression of full-length STIM1. Thus, exquisite Ca”* selectivity is 
not an intrinsic property of CRAC channels but rather a tuneable 
feature that is bestowed on otherwise non-selective ORAI1 channels 
by STIM1. Our results demonstrate that STIM1-mediated gating of 
CRAC channels occurs through an unusual mechanism in which 
permeation and gating are closely coupled. 

Functional CRAC channels are tetramers of ORAI1 subunits*°, 
with the pore flanked by residues of the first transmembrane domain 
(TM1) of each subunit®’ (Fig. 1a). To localize the gate region that 
governs STIM1-dependent activation, we mutated individual pore- 
lining residues to Cys and analysed state-dependent differences in 
the sensitivity of mutant channels to methanethiosulphonate (MTS) 
Cys-reactive reagents®*. Because the unusually narrow CRAC channel 
pore”’® prevents entry of the relatively large MTS reagents®, we per- 
formed these studies in the E106D ORAII mutant, which has a wider 
pore yet maintains store-dependent activation’. With this as a back- 
ground, several TM1 pore-lining residues, including V102C and 
G98C, became accessible to the small MTS reagent, MTSEA, with 
G98C showing particularly strong sensitivity to this reagent (Fig. 1 
and Supplementary Fig. 1). Inhibition of G98C by MTSEA could be 
protected by La** (Fig. 1b, d), which blocks CRAC channels by bind- 
ing to residues in the outer vestibule’, and this is consistent with 
modification of G98C occurring from within the pore. 

To determine differences in MTSEA accessibility between closed 
and open channels, we quantified the relief of MTSEA blockade that 
was elicited by the reducing agent bis(2-mercaptoethyl)sulphone 
(BMS)°. Resting cells were exposed to MTSEA for 100-120 s and 
subsequently, CRAC current (crac) was activated by passive store 
depletion (Fig. 1c). These experiments indicated that modification of 


G98C was profoundly state-dependent, with no modification occurring 
in closed channels (Fig. 1c, e). By contrast, D110C, a pore-lining residue 
located in the outer vestibule, was modified to similar extents in both 
closed and open states (Fig. le). The most straightforward explanation 
for this result is that the closed channel conformation prevents access of 
MTSEA to G98C, suggesting that the gate is located externally to G98 
but below D110. As the key pore-lining TM1 residues in this region are 
E106 and V102, the gated access of G98C implicates these residues as 
potential candidates for the gate. E106 controls Ca" selectivity'!"? and 
is not thought to regulate store-operated gating"®, leaving V102 as the 
most promising residue for further study. 

Previous reports suggest that V 102 is located very close to the central 
symmetry axis of the channel®”, that is, in a narrow constriction of the 
pore. If V102 is a component of the gating mechanism, mutations at 
this locus would be predicted to destabilize channel gating. Consistent 
with this possibility, a Cys mutation of V102 eliminated store-dependent 
gating. Cells expressing V102C ORAII and STIM1 displayed a large 
standing Icrac after whole-cell break-in (Fig. 2a). Moreover, resting 
cells exhibited constitutive Ca** entry and activation of the Ca**- 
dependent nuclear factor of activated T cell (NFAT) transcription 
factor (Supplementary Fig. 2), indicating that V102C ORAI1 channels 
are constitutively active. 

Several lines of evidence indicated that the constitutive activation of 
V102C ORAII is STIM1-independent. Large La**-sensitive standing 
currents were seen in cells expressing V102C ORAII alone (Fig. 2b). 
Furthermore, Ca”* imaging and NFAT activation experiments 
revealed constitutive Ca”* entry in these cells (Fig. 2c and Supplemen- 
tary Fig. 2c). Recent evidence indicates that STIM1 drives the redis- 
tribution of ORAI1 into discrete puncta after endoplasmic reticulum 
(ER) store depletion’. However, when expressed alone, V102C ORAI1 
remained diffusely distributed in the plasma membrane (Fig. 2d). 
Moreover, Icrac in these cells did not show Ca** -dependent fast in- 
activation (CDI) (Supplementary Fig. 3). Because puncta formation and 
CDI require STIM1 (ref. 14-16), these results indicate that when over- 
expressed alone, the mutant channels are functionally free of STIM1. 
Consistent with this interpretation, knockdown of endogenous STIM1 
in HEK293 cells did not affect the constitutive V102C current (Sup- 
plementary Table 1). However, when V102C ORAII was co-expressed 
with STIM1, puncta formation, interaction with STIM1, and CDI were 
indistinguishable from the behaviour of wild-type ORAII (Fig. 2d and 
Supplementary Figs 3 and 4). Additional analyses indicated that intro- 
ducing the mutations E106A or R91W, which abrogate store-operated 
ORAII activity’*’*"’, strongly diminished V102C ORAII currents 
(Supplementary Fig. 5a), indicating that these residues are essential 
for both store-operated and constitutive activation modes of ORAII. 
Mutation of the equivalent residue in Orai3 (V77C) also resulted in a 
STIM1-independent activation phenotype similar to that seen in 
V102C ORAII (Supplementary Fig. 6). Together, these results indicate 
that the V102C mutation destabilizes the channel gate, resulting in 
STIM1-independent constitutive ORAI] activation. 

Many ion channels including nicotinic acetylcholine receptors and 
the mechanosensitive channel MscL are reported to use hydrophobic 
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Figure 1 | State-dependent accessibility of pore- 
lining residues localizes the activation gate to the 
extracellular TM1 region. a, Schematic 
representation of the key pore-lining residues in 
ORAII (refs 6, 7). bs MTSEA modification of G98C 
is protected by La®’. A HEK293 cell co-expressing 
G98C/E106D ORAI1 (O1) and STIM1 was 
exposed to two applications of MTSEA (100 UM), 
the first in the presence of La** (100 uM) and the 
second after washout of La® ". Periodic applications 
of a divalent free (DVF, red rectangles) solution 
facilitated washout of La**. MTSEA inhibition 
was quantified by the relief of block induced by 
BMS (5 mM) (arrows). c, State-dependent 
modification of G98C. MTSEA (200 uM) was 
applied for 120s to resting cells and then washed 
off. After whole-cell break-in, Icrac was activated 
by passive store depletion by dialysing in BAPTA. 
BMS was applied to examine relief from MTSEA 
blockade (arrows). A second application of 
MTSEA and BMS provides a measure of blockade 
in open channels. A DVF solution was periodically 
applied to monitor Na -Icrac. d, Summary of 
MTSEA blockade of open G98C/E106D ORAIT]I in 
the presence and absence of La**.e, Summary of 
blockade of G98C/E106D and D110C ORAI1 by 
MTSEA in closed and open channels. Values are 
mean + s.e.m. 


Figure 2 | Mutations at V102 cause STIM1- 
independent constitutive ORAI1 activation. 

a, Time course of the development of Icrac in cells 
expressing wild-type (WT) or V102C ORAI] and 
STIM1 after whole-cell break-in. Intracellular Ca”* 
stores were depleted by dialysing cells with 8 mM 
BAPTA. b, V102C ORAI]I currents are 
constitutively active in the absence of STIM1 co- 
expression. c, Intracellular calcium concentration 
[Ca?*],; measurements in HEK293 cells expressing 
the indicated ORAI1 constructs in the absence of 
STIM1. UT, untransfected. d, Localization of 
V102C ORAI1-cyan fluorescent protein (CFP) 
before, and after, ER Ca”* store depletion in the 
absence (left) or presence (right) of STIM1-yellow 
fluorescent protein (YFP). e, Mutational analysis of 
V102. Normalized current densities of V102 
substitutions plotted against the solvation energies 
of the substituted amino acids” in the presence or 
absence of STIM1 co-expression. Currents were 
normalized to the mutant that yielded the maximal 
current density for each condition (Ala for STIM1- 
free cells and Ile in STIM1-co-expressing cells). 
Green, mutants that yield large constitutively active 
currents in the absence of STIM1]1; red, mutants that 
are not constitutively active but that require STIM1 
for activation. TG, thapsigargin. 


residues (Leu, Val and Ile) as gates to inhibit the flux of ions'**°. To 
explore the possibility that V102 comprises a hydrophobic gate in 
ORAI]I, we investigated the side-chain dependence at this position 
for constitutive activation. We observed constitutive activity with several 
mildly hydrophobic and polar substitutions, including Cys, Gly, Ala, Ser 
and Thr (Fig. 2e). Conversely, substitutions to the highly hydrophobic 
amino acids Leu, Ile and Met resulted in only STIM1-dependent activa- 
tion, as seen in wild-type ORAI1. Large hydrophobic residues such as 
Trp, Tyr and Phe attenuated both the constitutive and STIM1-induced 
currents, probably because of pore occlusion, as expected for a position 
that is nestled in a narrow region of the pore®’ (Supplementary Fig. 5b). 
Substitutions to extremely polar residues such as Glu, Asp, Lys and Arg 
resulted in non-functional channels with or without STIM1, probably 
owing to secondary effects of these mutations on the nearby selectivity 
filter at E106. Despite these deviations, however, the overall pattern is 
consistent with the hypothesis that V102 comprises a hydrophobic gate, 
with less hydrophobic substitutions producing a leaky gate. 

CRAC channels are extraordinarily Ca** selective and poorly per- 
meable to the large monovalent cation Cs*. However, the ion selectivity 
of STIM1-free V102C ORAII channels differed from wild-type ORAI1 
channels in both respects. STIM1-free V102C ORAII channels showed 
significantly lower Ca** permeability (P<0.0001), as demonstrated by 
the left-shifted reversal potentials of mutant Ca’* currents (Fig. 3a). 
Consistent with this interpretation, replacement of extracellular Na* 
with NMDG‘*, an impermeant ion, revealed significant Na‘ conduc- 
tion in these channels (P<0.0001) (Fig. 3b). Direct estimates of 
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Figure 3 | STIM1 regulates ion selectivity of constitutively active V102C 
ORAII channels. a, Current-voltage (I-V) relationships of V102C ORAI1 
currents in 20mM Ca** and DVF Ringer’s solutions. Arrows emphasize the 
reversal potential (V;ey) in each case. The bar graphs summarize the V;ey in the 
presence or absence of STIM1. Values are mean = s.e.m. siSTIM1, short 
interfering RNA targeting STIM1. b, Effects of substituting extracellular Na* 
with NMDG* on V102C ORAII currents in the absence or presence of STIMI. 
c, Effects of replacing the standard extracellular Ringer’s solution with Na‘ - or 
Cs* -based DVE solutions. In the absence of STIM1, large Cs* currents are seen 
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fractional Ca”* currents using fluo-4 indicated that V102C ORAI1 
conducts only 36% of the Ca** carried by wild-type ORAI1 in 
20mM Ca** (Supplementary Fig. 7a). In addition, unlike wild-type 
channels, STIM1-free V102C channels were highly permeable to the 
large monovalent cation Cs* (Fig. 3a-c and Supplementary Table 1). 

Unexpectedly, co-expressing exogenous STIM1 together with 
V102C ORAII enhanced the Ca** permeability and lowered the 
Cs* permeability of V102C ORAII, effectively correcting its aberrant 
ion selectivity (Fig. 3a—c). STIM]1 also modified permeation of V102C 
ORAII for Ba** and Sr** (Supplementary Fig. 7b). Modification of ion 
selectivity by STIM1 was not unique to V102C ORAII but occurred in 
all constitutively active V102X mutants (Supplementary Table 1). 
These changes in ion selectivity required direct STIM1-ORAI]1 inter- 
actions, as modification of V102C ORAL] ion selectivity was nullified 
in the V102C/L276D ORAI1 double mutant (Fig. 3d), in which 
STIM1-ORAI1 binding was impaired (Supplementary Fig. 4b, c)”’. 
STIM1-free and STIM1-bound V102C channels also displayed very 
different minimal pore widths (Fig. 3e), directly demonstrating that 
STIM1 alters the pore structure of V102C channels. Further analyses, 
using concatenated ORAII dimers (see Supplementary Methods) 
indicated that the subunit stoichiometry of STIM1-free and STIM1- 
bound mutant channels are identical, arguing that their distinct 
permeation properties are due to different pore structures of fully 
assembled, tetrameric channels rather than different subunit stoichio- 
metries (Supplementary Fig. 8). Collectively, these results indicate that 
STIM1 binding modifies the structural features of the mutant channel 
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in V102C ORAII channels. By contrast, no Cs* conduction is observed in the 
presence of STIM1. d, I-V relationship of currents in the V102C/L276D ORAI1 
double mutant in the presence or absence of STIM1. The bar graphs summarize 
the V,ey values of this mutant in the presence or absence of STIM1. Values are 
mean + s.e.m. e, Relative permeabilities of V102C ORAI] channels to different 
organic monovalent cations plotted against the size of each cation (test ion Px 
and Na* Py,). Dotted lines are fits to the hydrodynamic relationship. Values of 
dpore (the apparent width of the pore) estimated from the fits are 4.9 A for 

V102C ORAI1 + STIM1 channels and 6.9 A for V102C ORAI]1 channels. 
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pore, bestowing permeation properties that are associated with CRAC 
channels. 

The normalization of ion selectivity of mutant channels by STIM1 
suggests that their altered ion selectivity in the absence of STIM1 is not 
merely a byproduct of the mutations, but that it is indicative of a native 
intermediate activation state that is not readily seen in wild-type channels 
owing to a closed gate. Recent studies indicate that ORAI1 channels 
are activated in a nonlinear and cooperative manner by STIM1, with 
maximal channel activation requiring binding of eight STIM1 
molecules per channel’*”*”’. We reasoned that if modification of ion 
selectivity is coupled to the stoichiometry of STIM1 binding, then 
partial activation of wild-type ORAI1 by a sub-saturating concentra- 
tion of STIM1 may lead to incomplete normalization of ion selectivity, 
revealing an intermediate activation state. To test this hypothesis, we 
used constructs in which wild-type ORAI1 was tethered to either one 
or two functional STIM1 (S or SS) domains (resulting in four or eight S 
domains per channel, respectively), as recently described”*. We found 
that wild-type ORAI1 channels tethered to one S domain per subunit 
produced currents that were smaller and displayed diminished CDI 
compared to ORAI1-SS channels (Supplementary Fig. 9a), as expected 
from the known requirement of STIM1 for CRAC channel activation 
and inactivation’***”*, Importantly, reversal potential measurements 
and ion substitution experiments indicated that unlike ORAI1-SS 
channels, ORAI1-S channels exhibited diminished Ca** and enhanced 
Cs* selectivity (Fig. 4a,b). Similar alterations in ion selectivity were also 
seen when wild-type ORAI1 was expressed with limiting concentra- 
tions of full-length STIM1 (Supplementary Fig. 9b). These results 
support the hypothesis that the V102 mutations stabilize an inter- 
mediate channel activation state. To gauge the dose dependence of 
ion selectivity modulation by STIM1, we examined reversal potentials 
of V102C and wild-type channels tagged to zero, one or two S domains 
(Fig. 4c, d). Despite the different starting functional states of wild- 
type (closed) and V102C (open) channels, STIM1 caused similar, 


dose-dependent alterations in ion selectivity in both cases, while con- 
comitantly enhancing activation of wild-type channels (Fig. 4d). Thus, 
ORAII channel activation and changes in ion selectivity probably result 
from the same underlying energetic changes that are driven by STIM1 
binding. 

Collectively, our results show that mutations at V102 cause con- 
stitutive activation of ORAII channels through a mechanism that 
probably involves destabilization of the channel gate at V102. This 
disposition of the STIM1 activation gate, in the extracellular region 
of the pore close to the selectivity filter (E106), is markedly different 
from the familiar structural designs in K* channels and by extension, 
voltage-gated. Ca?* (Ca,) channels, which are constructed with the 
gate located at the cytoplasmic end of the pore**. We exploited the 
constitutive channel activity that resulted from mutations in the 
putative gate to identify an unusual ion channel gating mode in which 
STIM1 regulates ion selectivity and the pore architecture of CRAC 
channels. Activation by STIM1 bestows several key distinctive char- 
acteristics that are associated with CRAC channels including high 
Ca’* selectivity, low Cs* permeability and a narrow pore to otherwise 
non-selective ORAI1 channels. Although the underlying mechanism 
remains to be established, the close proximity of the putative STIM1 
activation gate (V102) to the selectivity filter (E106) probably contri- 
butes to the tight coupling of permeation and gating during channel 
activation. The altered ion selectivity of ORAI] channels when STIM1 
is limiting is reminiscent of the ion selectivity of ORAI1 and ORAI3 
channels directly activated by the compound 2-APB, which exhibit 
lower Ca** selectivity and higher Cs* selectivity than STIMI- 
activated ORAI channels”. These findings indicate that the exquisite 
Ca’* selectivity of CRAC channels is not an intrinsic and immutable 
property of ORAI! but is instead uniquely manifested only in response 
to STIM1 gating. Given the emerging evidence that indicates that 
ORAII can be activated in a STIM1-independent manner by other 
cellular activators”, these results raise the possibility that activation of 
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highly Ca** selective or non-selective ORAII currents may bea general 
mechanism for cells to tune Ca** and Na* entry through ORAII 
channels depending on the nature of the upstream activation signal. 


METHODS SUMMARY 

Tcrac was recorded in the standard whole-cell patch-clamp configuration in 
HEK293 cells transfected with the indicated ORAI1 mutants, which were cloned 
into a bicistronic expression vector that co-expressed GFP. For recording Icracs 
the membrane potential was hyperpolarized from +30 mV (holding) to —100 mV 
(100 ms) and then ramped from —100 mV to +100 mV (100 ms). MTS reagents 
were added to a 20 mM Ringer’s or divalent-free Ringer’s solution at the indicated 
concentrations. Second-order rate constants of MTSEA blockade were determined 
at a constant holding potential of —80 mV, as previously described’. 


Full Methods and any associated references are available in the online version of 
the paper at www.nature.com/nature. 
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METHODS 

Cells. HEK293 cells were maintained in suspension in a medium containing 
CD293 supplemented with 4 mM GlutaMAX (Invitrogen) at 37°C, 5% COs. 
For imaging and electrophysiology, cells were plated and adhered to poly-1- 
lysine-coated coverslips at the time of passage, and grown in a medium containing 
44% DMEM (Mediatech), 44% Ham’s F12 (Mediatech), 10% fetal calf serum 
(HyClone), 2 mM glutamine, 50 U ml penicillin and 50 jig ml’ streptomycin. 
Plasmids and transfections. Cys mutations that were used for characterization 
using electrophysiology were engineered into the previously described C-terminal 
Myc-tagged ORAI1 construct (MO70 ORAI]) in the bicistronic expression vector 
pMSCV-CITE-eGFP-PGK-Puro”’. For generating the tandem dimers, the basic 
‘building blocks’ (monomeric ORAI1 (MO7O) and the Bluescript SK* vector with 
ORAII attached to a linker) were obtained from T. Shuttleworth and constructed 
as previously described’. The orientation and number of subunits in the final 
constructs were confirmed by restriction enzyme analysis and western blot analysis. 
The ORAI1-S-eGFP and ORAI1-SS-eGFP constructs were kind gifts of T. Xu. The 
ORAI1-CFP, STIM1-YFP, and CFP-ORAI3 plasmids have been previously 
described*'**. Site-directed mutagenesis to generate ORAI1 mutations was per- 
formed using the QuikChange Site- Directed Mutagenesis Kit (Stratagene) according 
to manufacturer’s instructions and the results were confirmed by DNA sequencing. 
For electrophysiology, the indicated ORAI1 constructs were transfected into 
HEK293 cells either alone or together with a construct expressing unlabelled 
STIMI1 (pCMV6-XLS, Origene Technologies). Cells were transfected with the indi- 
cated STIM1 and/or ORAI1 complementary DNA (ratio of 10:1 by mass for elec- 
trophysiology) using TransPass D2 (NEB Labs) and studied 24h later. Cells that 
were transfected with siSTIM1 (Ambion) were studied 72 h following transfection. 
Electrophysiology. Currents were recorded in the standard whole-cell configura- 
tion at room temperature on an Axopatch 200B amplifier (Molecular Devices) 
interfaced to an ITC-18 input-output board (Instrutech). Routines developed by 
R. S. Lewis and M. Prakriya on the Igor Pro software (Wavemetrics) were used for 
stimulation, data acquisition and data analysis. Data are corrected for the liquid 
junction potential of the pipette solution relative to Ringer’s in the bath (—10 mV). 
The holding potential was +30 mV. The standard voltage stimulus consisted of a 
100-ms step hyperpolarization to -100 mV followed by a 100-ms ramp depolariza- 
tion from -100 to +100 mV, applied at 1-s intervals. Unless noted otherwise, the 
peak currents during the -100 mV pulse were measured for data analysis. For 
examining Ca**-dependent fast inactivation, the voltage protocol consisted of a 
300-ms step decrease to -100 mV, applied at 2-s intervals. Unless otherwise indi- 
cated, Icrac was activated by passive depletion of intracellular Ca** stores by 
internal dialysis of 8 mM BAPTA through the pipette solution. To prevent com- 
plications arising from the changing membrane potential in the standard 
step-ramp voltage protocol, rate constants of blockade by MTS reagents were deter- 
mined at a constant potential of -80 mV by acquiring 200-ms sweeps of current at 
4 Hz. All currents were acquired at 5 kHz and low-pass filtered with a 1-kHz Bessel 
filter built into the amplifier. All data were corrected for liquid junction potential of 
the pipette solution and for leak currents collected in 50-150 tM LaCl. 

MTS reagent protocol. The protocol for analysis of state-dependent modification 
of ORAI] bearing Cys mutants is described in Fig. 1c. For the D110C mutant, this 
protocol was slightly modified to adjust for the formation of spontaneous disulphide 
bonds in this mutant®. The protocol included an additional application of the 


reducing agent BMS 90-120 before the first MTSEA application to remove pre- 
existing disulphide bonds, as described previously’. 

Solutions. The standard extracellular Ringer’s solution contained 130 mM NaCl, 
4.5mM KCl, 20mM CaCl,, 10mM tetraethylammonium chloride (TEA-Cl), 
10mM p-glucose, and 5mM Na-HEPES (pH7.4). The DVF Ringer’s solution 
contained 150mM NaCl, 10mM HEDTA, 1mM EDTA, 10mM TEA-ClI and 
5mM HEPES (pH7.4). The 110-mM Ca’* solution contained 110 mM CaCh, 
10mM b-Glucose and 5mM HEPES (pH7.4). The standard internal (pipette) 
solution contained 135mM Cs-aspartate, 8mM MgCl, 8mM BAPTA and 
10 mM Cs HEPES (pH 7.2). In experiments examining CDI, BAPTA was replaced 
with EGTA to accentuate CDI. In these experiments, the internal solution con- 
tained 125 mM Cs-aspartate, 10 mM EGTA, 3 mM MgCl, 8 mM NaCl and 10 Cs 
HEPES (pH 7.2). Stock solutions of MTS reagents (Toronto Research Chemicals) 
were prepared as previously described®. 

Data analysis. Reversal potentials were measured from the average of several leak- 
subtracted sweeps (4-6 sweeps) in each cell. Measurements were taken from 6-15 
cells per mutant per condition. In cases in which the I-V curve asymptotically 
approached the x axis at very positive membrane potentials with no clear reversal 
(for example, in wild-type ORAI1-expressing cells), the reversal potential was 
assigned as +80 mV. The MTSEA reaction rate constant was estimated from single 
exponential fits to the current decline after MTSEA application. The apparent 
second-order modification rate constant k,,, was calculated from the relationship: 


1 


kon = ATC) 
t{MTS| 
where [MTS] is the concentration of the MTS reagent. 
Relative permeabilities were calculated from changes in the reversal potential 
using the Goldman-Hodgkin-Katz voltage equation: 


Py [Na], A Evey F/RT 


Pya [X] 


o 


where R, T and F have their usual meanings, Px and Py, are the permeabilities of 
the test ion and Na’, respectively, [X] and [Na] are the ionic concentrations, and 
AE rey is the shift in reversal potential when the test cation is exchanged for Na*. To 
estimate the minimal width of ORAI1 channels, the relative channel permeabilities 
for a series of organic monovalent cations of increasing size were examined as 
described before’. The cations used were ammonium (3.2 A), methylammonium 
(3.78 A), dimethylammonium (4.6 A) and trimethylammonium (5.34 A). These 
experiments were carried out in buffered Ca”*-free solutions to avoid the potent 
blocking effects of Ca** ions on monovalent Icrac. The data were fitted to the 


hydrodynamic relationship”: 
P. ln 
XxX =K 1 dion 
P Na pore 


where d;on is the diameter of the tested ion and dyore is the apparent width of the 
pore. 

All data were corrected for leak currents collected in 20 mM Ca?* + 50-150 uM 
La®*. All curve fitting was done by least-squares methods using built-in functions 
in Igor Pro 5.0. 
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Structure of the human M2 muscarinic acetylcholine 
receptor bound to an antagonist 
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The parasympathetic branch of the autonomic nervous system 
regulates the activity of multiple organ systems. Muscarinic receptors 
are G-protein-coupled receptors that mediate the response to 
acetylcholine released from parasympathetic nerves'*. Their role 
in the unconscious regulation of organ and central nervous system 
function makes them potential therapeutic targets for a broad 
spectrum of diseases. The M2 muscarinic acetylcholine receptor 
(M2 receptor) is essential for the physiological control of cardio- 
vascular function through activation of G-protein-coupled inwardly 
rectifying potassium channels, and is of particular interest because of 
its extensive pharmacological characterization with both orthosteric 
and allosteric ligands. Here we report the structure of the antagonist- 
bound human M72 receptor, the first human acetylcholine receptor to 
be characterized structurally, to our knowledge. The antagonist 
3-quinuclidinyl-benzilate binds in the middle of a long aqueous 
channel extending approximately two-thirds through the mem- 
brane. The orthosteric binding pocket is formed by amino acids that 
are identical in all five muscarinic receptor subtypes, and shares 
structural homology with other functionally unrelated acetylcholine 
binding proteins from different species. A layer of tyrosine residues 
forms an aromatic cap restricting dissociation of the bound ligand. 
A binding site for allosteric ligands has been mapped to residues 
at the entrance to the binding pocket near this aromatic cap. The 
structure of the M2 receptor provides insights into the challenges of 
developing subtype-selective ligands for muscarinic receptors and 
their propensity for allosteric regulation. 

Muscarinic receptors constitute a family with five subtypes, M1-M5 
(ref. 1). M1, M3 and M5 subtypes couple with the G, family of G 
proteins, and M2 and M4 subtypes with the G,/G, family of G proteins. 
Previous work showing that the muscarinic action by a series of choline 
esters and other substances in various tissues could be differentiated 
from their nicotinic action’ led to muscarinic acetylcholine receptors 
being defined as a functional concept. Muscarinic receptors are now 
known to be G-protein-coupled receptors (GPCRs)? and the nicotinic 
receptor a ligand-gated ion channel. Muscarinic receptors were 
initially defined biochemically as proteins that specifically bound 
3-quinuclidinyl-benzilate (QNB) and N-methylscopolamine (NMS). 
They were among the first GPCRs to be purified from cerebral mem- 
branes’, and to be functionally reconstituted with purified G protein 
in lipid vesicles’. The M1 receptor* together with the B, adrenergic 
receptor® were the first neurotransmitter-activated GPCRs to be 
cloned, revealing the seven transmembrane (TM) segment topology 
initially observed for rhodopsin’, and subsequently found to be 
common to all members of the GPCR family. 

As a consequence of their roles in both the central and para- 
sympathetic nervous systems, muscarinic receptors are targets for 
the treatment ofa spectrum of disorders including Alzheimer’s disease, 


schizophrenia and Parkinson’s disease, and chronic obstructive 
pulmonary disease*. However, developing highly subtype-selective 
orthosteric drugs for muscarinic receptors has been challenging and 
thus far largely unsuccessful. Recent drug discovery efforts have 
therefore shifted to the development of small molecule allosteric 
modulators. Muscarinic receptors have long been a model system 
for studying allosteric regulation of GPCR signalling because of their 
exceptional propensity to bind allosteric ligands’. To understand 
better the structural basis for challenges in developing orthosteric 
drugs and the susceptibility for allosteric regulation, we obtained a 
crystal structure of the M2 receptor. 

In our initial efforts to obtain the structure of the M2 receptor 
we expressed and purified M2 receptor lacking most of the third 
intracellular loop (IL3) and the native glycosylation sites. The central 
part of IL3 of the M2 receptor can be removed without impairing its 
ability to bind to agonists or activate G proteins'®, and IL3 was shown 
to have a flexible structure’. Using this modified M2 receptor bound 
to the high-affinity inverse agonist R-(—)-3-QNB, we performed 
crystallization by hanging-drop vapour diffusion and obtained crystals 
that diffracted to around 9 A, but were not able to improve the quality 
of these crystals. We subsequently replaced IL3 of the M2 receptor 
with T4 lysozyme (T4L) as initially described for the B, adrenergic 
receptor’” (Supplementary Fig. 1a). This method has been used to obtain 
crystal structures of four other GPCRs: the adenosine Az, receptor’’, the 
CXCR4 receptor", the dopamine receptor D3 (ref. 15) and most recently 
the histamine H, receptor’®. The binding properties of M2-T4L with 
muscarinic ligands were essentially the same as for the wild-type M2 
receptor (Supplementary Fig. 1b, c), indicating that the overall TM 
architecture of M2-T4L was minimally affected by introduction of 
TAL. The M2-T4L receptor was subsequently crystallized in lipidic cubic 
phase. A 3.0 A structure was solved by molecular replacement from a 
data set obtained by merging diffraction data from 23 crystals. 

As is typical for proteins crystallized by the lipidic cubic phase 
method, the lattice for the M2 receptor shows alternating aqueous 
and lipidic layers with M2 receptor molecules embedded in the latter 
while T4L is confined to aqueous regions (Supplementary Fig. 2). 
Within the membrane plane, receptor molecules are packed closely 
against one another, alternating orientations within the bilayer. There 
are abundant hydrophobic contacts between receptor molecules 
within the membrane, whereas polar interactions primarily involve 
contacts between T4L molecules as well as receptor-T4L interactions. 

The overall structure of the M2 receptor (Fig. 1a) is similar to that of 
rhodopsin and other recently crystallized inactive GPCR structures 
(compared in Supplementary Fig. 3). The cytoplasmic surface of the 
M2 receptor is in an inactive conformation, but as with most other 
GPCR structures, there is no interaction involving Arg 121°°° (super- 
scripts indicate Ballesteros-Weinstein numbers) in the conserved 
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Figure 1 | The M2 receptor with bound QNB. a-e, The M2 receptor is shown 
as a blue ribbon and QNB as orange spheres. a, M2 receptor in profile. 

b, Cytoplasmic surface showing conserved DRY residues in TM3. 

c, Extracellular view into QNB binding pocket. d, Extracellular view with 
solvent-accessible-surface rendering shows a funnel-shaped vestibule and a 


E/DRY sequence in TM3 and Glu 382°°° in TM6 (Fig. 1b). Instead, the 
Arg121°*° side chain forms a salt bridge only with Asp 120°”. In 
rhodopsin, the homologous residues form part of a charge-charge 
interaction that stabilizes the cytoplasmic ends of TM3 and TM6 in an 
inactive state'’. The second intracellular loop shows a helical conforma- 
tion similar to that first seen for the turkey B, adrenergic receptor”. 
GPCR crystal structures show the greatest differences in the extra- 
cellular surface (Supplementary Fig. 3). The M2 receptor has a relatively 
simple and open extracellular surface (Fig. 1c, d) with the longer extra- 
cellular loop (ECL)2 stabilized by a conserved disulphide with Cys 96°” 
at the N terminus of TM3 and Cys176 in the middle of ECL2. In 
addition, the second disulphide bond was detected between C413 and 
C416 in the ECL3. The extracellular surface of the M2 receptor most 
resembles that of the dopamine D3 receptor (Supplementary Fig. 3). 
Crystal structures of GPCRs reveal a network of hydrogen bonding 
interactions that extend from the binding pocket to the cytoplasmic 
surface. However, a distinctive feature of the M2 receptor is that this 
network is part of a long, continuous aqueous channel extending from 
the extracellular surface to a depth of approximately 33 A when 
measured from ECL2 (Fig. le). This channel contains the ligand bind- 
ing pocket, but extends beyond the ligand and is separated from the 
cytoplasmic surface by a hydrophobic layer formed by three amino 
acids: Leu65~*° in TM2, Leu 114" in TM4 and Ile 392°° in TM6. 
Each of these is absolutely conserved among all five muscarinic sub- 
types. The dimensions of the channel below the QNB binding site are 
large enough to accommodate a long, extended orthosteric ligand. 
Supplementary Fig. 4 compares the aqueous channels of other GPCRs. 
The ligand QNB binds within a deeply buried pocket defined by the 
side chains of TM3, 4, 5, 6 and 7 (Fig. 2a—c and Supplementary Fig. 5 
and Supplementary Table 3). An aromatic cage encloses the amine and 
forms a lid over the ligand, separating the orthosteric site from the 
extracellular vestibule. Asp 103°°” and Asn 404°°” serve to orient the 
ligand in the largely hydrophobic binding cavity, with Asn 404°°? 
forming paired hydrogen bonds with the hydroxyl and carbonyl groups 
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nearly buried QNB binding pocket. e, Aqueous channel (green) extending from 
the extracellular surface into the transmembrane core is interrupted by a layer 
of three hydrophobic residues (blue spheres). Well-ordered water molecules are 
shown as red dots. 


in QNB while Asp 103° engages in a charge—charge interaction with 
the amine moiety of the ligand (Fig. 2). The TM amino acids that form 
the QNB binding pocket are identical in all five muscarinic receptor 
subtypes (Supplementary Table 1), consistent with results of QNB 
binding experiments on M1-M4 receptors, and with site-directed 
mutagenesis experiments on M1 (ref. 19), M2 (ref. 20) and M3 
(ref. 21) receptors. Only Phe 181, which extends downward from 
ECL2 and interacts with one of the two phenyl rings on QNB 
(Fig. 2), differs from all other muscarinic receptor subtypes, which have 
leucine in the homologous position. The importance of Asp**” for both 
agonist and antagonist binding has been demonstrated in mutagenesis 
and covalent-labelling experiments and modelling studies’? ”*. In con- 
trast, mutation of Asn 404°” to Ala on M1 (ref. 23) and M3 (ref. 24) 
receptors was shown to greatly affect binding of QNB but have little 
effect on binding of or activation by acetylcholine. It is possible that 
Asn 404°°? is hydrogen bonded with the ester group of QNB but not 
that of acetylcholine. 

The M2 and other muscarinic receptors represent one of four families 
of acetylcholine binding proteins to be structurally characterized thus 
far. Figure 3a shows the orthosteric binding site of the M2 receptor with 
acetylcholine docked with the gauche form of the O-C2-C1-N dihedral 
angle, which places the choline group in the aromatic cage interacting 
with Asp 103°*”, while the carbonyl oxygen is tentatively bound to 
Asn 404°*? (Fig, 3a). The natural agonist acetylcholine is much smaller 
than the bulky antagonist QNB. As described in the agonist-bound 
structure of the B, adrenergic receptor, the contraction of the ligand 
binding pocket is expected as a result of an inward shift of TM5 (ref. 25). 
This result is consistent with the previous mutation studies showing that 
Thr 187°? and Thr 190°? in TM5 (Fig. 2) alter binding of most 
agonists but not of antagonists””. Bulky compounds capable of blocking 
activation-related contraction of the pocket would be very efficient in 
locking the M2 receptor in an inactive conformation, as is exemplified 
here by the antagonist QNB. It has been proposed that the con- 
formational change of the M2 receptor upon activation might be 
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Figure 2 | Binding interactions between the M2 receptor and QNB. 

a, b, Two views of the QNB binding pocket. Amino acids within 4 A of the 
ligand are shown as light blue sticks, with QNB in orange. Nitrogen and oxygen 
atoms are coloured dark blue and red, respectively. Polar interactions are 
indicated by dashed lines. A 2F, — F. map is shown in wire at 1.50 contour. c, A 
schematic representation of QNB binding interactions is shown. Mutations of 


accompanied by a conformational change of acetylcholine from the 
gauche to the trans form of the O-C2-C1-N dihedral angle*®. It 
remains to be determined in which pose acetylcholine binds to the 
M2 receptor or to the M2-receptor—G-protein complex, and whether 
acetylcholine hydrogen bonds with Asn 404° or other residues. 

Ina striking example of convergent evolution, the orthosteric site of 


the M2 receptor exhibits many features noted previously as common 
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© ChoxX protein (bacterium) 
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Figure 3 | Convergent evolution of acetylcholine binding sites. 

a, Acetylcholine is modelled into the crystal structure of the M2 receptor. 
b, Acetylcholine binding pocket in the crystal structure of the acetylcholine 
binding protein from the snail Aplysia californica (PDB accession 2XZ5). 


1 

1 

oe 

7 1 
0. 


LETTER 


T190°-42 


a 
s ? 
. ¢ 
* ¢ 


w 


yas07-* | / 
Ul 3.40 
--HO- |V111 
fe) i o--- vi11949| 


ay 


[i Mutation reduces agonist and antagonist affinity by >10 fold 
[EJ Mutation reduces antagonist affinity by >10 fold 
= Mutation reduces agonist affinity by >10 fold 


= =» = Hydrophobic contacts 
==== Polar interations 


amino acids in the red boxes have been shown to reduce both antagonist and 
agonist binding by more than tenfold. Mutations of the amino acid in the 
purple boxes reduce antagonist binding affinity by more than tenfold. 
Mutations of amino acids in the blue boxes reduce agonist binding by more 
than tenfold. Blue dotted lines indicate potential hydrophobic interactions and 
red lines indicate potential polar interactions. 


structural elements in unrelated acetylcholine binding proteins”’. Like 
the M2 receptor, a nicotinic acetylcholine receptor homologue bound 
to acetylcholine (Fig. 3b) shows an aromatic cage comprised of three 
tyrosines and a tryptophan, although it notably lacks a counterion to 
the choline group”, whereas in the M2 receptor this role is filled by 
Asp 103°°*. A bacterial acetylcholine binding protein, ChoX, from 
Sinorhizobium meliloti (Fig. 3c) also possesses an aromatic cage, and 


b Acetylcholine binding protein (snail) 


Acetylcholine 


Te 


ds Acetylcholine esterase (electric ray) 


a0 


Thioacetylcholine 


c, Acetylcholine binding pocket in the acetylcholine binding protein ChoX 
from the Gram negative bacterium Sinorhizobium meliloti (PDB accession 
2RIN). d, Binding site for thio-acetylcholine in the enzyme acetylcholine 
esterase from the electric ray Torpedo californica (PDB accession 2C4H). 


00 MONTH 2012 | VOL 000 | NATURE | 3 


©2012 Macmillan Publishers Limited. All rights reserved 


LETTER 
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Figure 4 | Allosteric binding in the M2 receptor. a, Differences between the 
M2 and M4 receptors are shown as green residues mapped onto the inner 
surface of the M2 receptor (blue), with QNB in orange spheres. The sequence 
conservation within the orthosteric site is apparent, while residues outside show 
more variability. b-d, Mutations that alter allosteric binding are shown with 
yellow carbons, and amino acids involved in QNB binding are shown with blue 


like the M2 receptor has an aspartate in close proximity to the amine 
engaging in a charge-charge interaction”. Also like the M2 receptor, 
ChoX has an asparagine hydrogen bonding to the ligand carbonyl. Like 
these proteins, the enzyme acetylcholine esterase (Fig. 3d) uses an 
aromatic cage and a carboxylate to bind the choline group, while the 
(thio)acetyl group interacts with a phenylalanine, probably through 
m-m interactions®®. Taken together, these structures suggest that an 
aromatic cage and buried carboxylate are likely to be critical elements 
for acetylcholine recognition and binding in general. 

There is a growing interest in the development of allosteric ligands for 
GPCR targets. This is motivated by the ability to develop more subtype- 
selective drugs targeted at less conserved regions of the receptor. 
Moreover, allosteric ligands modulate the effects of natural hormones 
and neurotransmitters, and may therefore regulate receptor activity in 
a more physiological manner. As noted above, the orthosteric binding 
pocket is highly conserved among all muscarinic receptor subtypes. 
Allosteric regulation of GPCRs was first observed for the M2 receptor 
and this receptor has been one of the most extensively characterized 
allosteric model systems’. Figure 4a shows the inner surface of the M2 
receptor, highlighting residues that are not conserved with its closest 
relative, the M4 receptor. It can be seen that the orthosteric binding 
pocket and transmembrane core are highly conserved. The greatest 
diversity is observed in the extracellular loops and the extracellular end 
of TM segments that form the entrance to the orthosteric binding 
pocket. These amino acids represent structural diversity that could 
be exploited for the development of more subtype-selective ligands’. 
Of interest, site-directed mutagenesis and chimaeric receptor studies 
have implicated several of these amino acids in the binding of several 
well-characterized allosteric modulators’. As shown in Fig. 4b-d, these 
residues are located in ECL2 and the amino-terminus of TM7 at the 
entrance to the binding pocket. Trp 422”*», a residue implicated in the 
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carbons as sticks or spheres. b, c, Different views of possible allosteric binding 
sites in the M2 receptor. The surface view in c shows the positions of possible 
allosteric binding sites (yellow) lining the path to the QNB binding pocket. 

d, Trp 422 (yellow spheres), implicated in binding of allosteric ligands, forms an 
edge-to-face aromatic interaction with Tyr 403, part of the aromatic cage (blue 
spheres) of the orthosteric site. 


binding of several allosteric modulators, appears to form an edge-to- 
face n—T interaction with Tyr 403°", part of the aromatic cage sur- 
rounding the charged amine of the orthosteric ligand (Fig. 4d). Binding 
of allosteric ligands to this site would be expected to influence the 
association and disassociation rates of orthosteric ligands. 

The structure of the M2 receptor provides insights into both orthosteric 
and allosteric regulation of muscarinic receptors. The development of 
more selective drugs for muscarinic receptors will probably require 
exploitation of the more diverse allosteric surface, either as exclusively 
allosteric ligands or as ligands that occupy both orthosteric and 
allosteric sites. 


METHODS SUMMARY 


Untagged human M2 muscarinic acetylcholine receptor was expressed in Sf9 cells 
with the IL3 replaced with T4 lysozyme, then extracted with digitonin and sodium 
cholate and purified by ligand affinity chromatography, then exchanged into decyl 
maltoside buffer. Purified receptor was crystallized by the lipidic cubic phase 
technique following addition of a stabilizing neopentyl glycol detergent. Data 
collection was performed at Advanced Photon Source beamlines 23ID-B and 
23ID-D, and the structure solved by molecular replacement. Refinement statistics 
are given in Supplementary Table 2. 


Full Methods and any associated references are available in the online version of 
the paper at www.nature.com/nature. 
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METHODS 


Construction of M2-TAL expression vectors for Sf9 cells. The coding sequence 
of the human M2-T4L receptor fusion protein was designed to have N-linked 
glycosylation sites (Asn 2, Asn 3, Asn6 and Asn9) mutated to aspartic acid and 
cysteine-less T4L (C54T, C97A) residues 2-161 inserted into the IL3, replacing 
M2 residues 218-376. This construct was synthesized (TAKARA Bio), and cloned 
into the pFastbacl Sf9 expression vector (Invitrogen) as illustrated in Supplementary 
Fig. la. A TAA stop codon was placed after the R466 codon, terminating translation. 
The synthesized M2-T4L receptor described above was confirmed by sequencing. 
Expression and membrane preparation. Recombinant baculovirus was made 
from pFastbacl-M2-T4L using the Bac-to-Bac system (Invitrogen)*". The M2- 
TAL protein was expressed in baculovirus-infected Sf9 insect cells as described 
previously”. Sf9 insect cells were prepared at a density of 1.0 X 10° cells ml ' and 
suspended in 5 litres of the IPL-41/SF900 II complex media or ESF921 insect 
media. Media containing Sf9 insect cells were transferred into the CELLBAG 22 
L/O (GE Healthcare) and cultured for 4 days with the following culture conditions: 
20 r.p.m., 8.5° rocking angle, 30% Oo, 0.251 min | ofair flow rate and 27 °C. After 
4 days, 200 to 300 ml of the M2-T4L baculovirus stock (approximate multiplicity 
of infection (m.o.i.), 2) and 700 to 800 ml of IPL-41/SF900 II complex media were 
transferred into the CELLBAG (final culture volume, 6 litres) and infected for 2 
days under the following infection conditions: 22 r.p.m., 8.5° rocking angle, 50% 
O,, air flow rate, 0.25lmin ' and 27°C. Two days later, a fraction of the cells was 
harvested for the binding assay and the remaining cells were centrifuged at 6,000g¢ 
for 10 min and harvested. The cell pellet was washed with 250 ml of PBS without 
calcium chloride and magnesium chloride (PBS(—)) and resuspended with 100 ml 
of PBS(—) containing a protease inhibitor cocktail tablet (Roche). Final concen- 
tration of protease inhibitors was 2.5 1g ml’ pepstatin, 2 4g ml ’ PMSF, 20 pg 
ml ' leupeptin and 0.5 mM benzamidine. Cells were quick frozen in liquid nitro- 
gen and stored at —80 °C. 

The membrane was prepared from the M2-T4L-expressing Sf9 insect cells as 

described previously*’. For the preparation of membranes from insect cells, Sf9 
insect cells were centrifuged at 1,500g for 10 min at 4 °C. The pellet was washed 
with PBS(—), then resuspended in 100 ml of hypotonic buffer containing 10 mM 
HEPES at pH 7.5, 20mM KCl, 10mM MgCl and protease inhibitor cocktail, 
followed by Dounce homogenization to resuspend the membranes. Insect cell 
membranes were centrifuged at 100,000g for 30 min and the pellets were resus- 
pended in 10 mM HEPES at pH 7.5, 10 mM MgCl, 20 mM KCl, 40% glycerol, and 
snap-frozen in liquid nitrogen and then stored at —80°C until use. Membrane 
proteins were quantified using the bicinchoninic acid (BCA) method (Pierce) 
using a BSA standard. 
Purification of M2-T4L-QNB. M2-T4L was expressed in Sf9 cells, solubilized 
with digitonin/Na-cholate solution, and purified by using an affinity column with 
aminobenztropine (ABT) as a ligand*’, as described below. The whole procedure 
was carried out at 4 °C. Sf9 membrane preparations with 2.1 kg of wet weight and 
approximately 1.5 jumol of [*H]QNB binding sites were solubilized with 1% digi- 
tonin, 0.35% Na-cholate, 10mM K-phosphate buffer (pH 7.0) (KPB), 50 mM 
NaCl, 1mM EDTA, a cocktail of protease inhibitors (41). The supernatant was 
applied to two ABT columns run in parallel (500 ml each), followed by washing 
with 0.1% digitonin, 0.1% Na-cholate, 20 mM KPB, 150mM NaC] (21 X 2) ata 
rate of approximately 90 mlh'. M2-T4L was eluted from the ABT columns with 
0.5 mM atropine, 0.1% digitonin, 0.1% Na-cholate, 20 mM KPB, 150 mM NaCl in 
21 elution volume for each column, and was bound to a column of hydroxyapatite 
(30 ml), which was washed at a rate of 30-50 mlh ! with a series of solutions as 
follows: (1) 0.1% digitonin, 0.1% Na-cholate, 20 mM KPB (100 ml); (2) 5 uM QNB, 
0.1% digitonin, 0.1% Na-cholate, 20mM KPB (600 ml); (3) 0.35% Na-cholate, 
20mM KPB (600 ml); (4) 0.2% decylmaltoside, 20 mM KPB (500 ml); (5) 0.2% 
decylmaltoside, 150 mM KPB (100 ml); (6) 0.2% decylmaltoside, 500 mM KPB 
(60 ml). M2-T4L-QNB was finally eluted with 0.2% decylmaltoside, 1M KPB 
(50 ml). The eluate was concentrated to approximately 1ml (approximately 
30 mg protein per ml) with Amicon Ultra (MILLIPORE), followed by dialysis 
against 0.2% decylmaltoside, 20mM Tris-HCl buffer (pH 7.5) and storage in 
—80 °C. The yield was estimated to be approximately 50% on the assumption that 
the recovered protein is pure M2-T4L. Protein concentration was determined 
using BCA Protein Assay (PIERCE). Because we purified M2-T4L as a complex 
with QNB we could not estimate the [7H]QNB binding activity because the dis- 
sociation rate of QNB is too slow. However, in preliminary experiments using 
[H]QNB or dissociable atropine as eluants, we confirmed that the receptor is 
purified to near homogeneity. The purity of M2-T4L was confirmed by SDS- 
PAGE and gel permeation chromatography (Supplementary Fig. 6). All the 
QNB used in purification and crystallization was the high-affinity enantiomer, 
R-(—)-3-QNB. 


Measurement of ligand binding activity. Ligand binding activity of wild-type 
M2 and M2-T4L receptors was determined as described previously™. Briefly, the 
receptors solubilized from Sf9 membranes were incubated with 0.1-4nM 
(H]QNB with or without 1 uM atropine, or with 2nM PH]QNB with various 
concentrations of carbamylcholine or atropine in 0.1% digitonin, 20 mM KPB for 
60 min at 30 °C (total volume 0.2 ml). The amount of [>H]QNB bound to receptors 
was assayed by using a small column of Sephadex G50 fine (2 ml). The density of 
[PH] QNB binding sites in the particulate fraction of M2-T4L was 17 pmol per mg 
of protein on average and ranged from 5.3-35 pmol per mg of total protein. 
Crystallization. QNB-bound M2-T4L was concentrated to 20mg ml ' in decyl 
maltoside buffer in a volume of approximately 100 pl. A 10% stock solution of 
lauryl maltose neopentyl glycol detergent (MNG, Anatrace) with 100 mM NaCl 
and 20 mM HEPES pH 7.5 was then added to the protein to a final concentration 
of 1% (w/v) of MNG detergent. The sample was incubated for 1h on ice, then 
diluted to 1 ml in 0.1% MNG buffer and reconcentrated to 50mg ml * before 
reconstitution. The final volume of protein sample at this concentration was 
typically 20-30 kl. Protein was reconstituted in cubic phase by mixing with a 
1.5-fold weight excess of a 10:1 monoolein:cholesterol mix by the twin-syringe 
method”. Briefly, the protein and lipid were mixed by passage through coupled 
syringes 100 times either by hand or using a Gryphon LCP robot (Art Robbins 
Instruments). The reconstituted protein was dispensed using a modified ratchet 
device (Hamilton) or using the Gryphon LCP robot in 40 nl drops to either 24-well 
or 96-well glass sandwich plates and overlaid with 0.8 ul precipitant solution. A 
single crystallization lead was initially identified using an in-house screen and then 
optimized. Crystals for data collection were grown in 25-35% PEG 300, 100 mM 
ammonium phosphate, 2% 2-methyl-2,4-pentanediol, 100 mM HEPES pH 7.0- 
7.8. Crystals reached full size and were harvested after 3-4 days at 20 °C. Typical 
crystals are shown in Supplementary Fig. 7. 

Data collection and processing. Diffraction data were measured at the Advanced 
Photon Source beamlines 23 ID-B and 23 ID-D. Several hundred crystals were 
screened, and a final data set was compiled using diffraction wedges of typically 5 
degrees from the 23 most strongly diffracting crystals. Data reduction was per- 
formed using HKL2000°°. Diffraction quality was very heterogeneous, with some 
crystals diffracting to 2.3 A whereas others failed to diffract past 3.5 A. Among the 
best crystals, most diffracted to 3.0-2.5 A. Severe radiation damage and aniso- 
tropic diffraction resulted in low completeness in higher resolution shells. We 
report this structure to an overall resolution of 3.0 A. Despite the low completeness 
in high resolution bins, inclusion of these reflections significantly improved map 
quality. Highest shell <I>/<cI> is relatively low, in large part due to anisotropy 
of the diffraction. The final resolution cut-off was chosen on the basis of complete- 
ness and <I>/<oI> in the spherical highest shell, but analysis of average F/oF 
values along reciprocal space axes suggests resolution limits (based on average 
F/oF> 3) of 3.5, 2.9 and2.7A along a*, b* and c*, respectively. The real space 
c-axis is normal to the plane of the lipid membrane in the crystal. 

Structure solution and refinement. The structure was solved by molecular 
replacement using Phaser*’”** with the structure of the inactive B, adrenergic 
receptor and T4L used as search models (PDB accession 2RH1). The initial 
molecular replacement model was further fitted by rigid body refinement followed 
by simulated annealing and restrained refinement in Phenix”. Iterative manual 
rebuilding and refinement steps were performed with Coot and phenix.refine, 
respectively. Figures were prepared with PyMOL, and Ramachandran statistics 
were calculated with MolProbity. 
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Maternal and paternal genomes contribute equally to 
the transcriptome of early plant embryos 


Michael D. Nodine!? & David P. Bartel!? 


In animals, maternal gene products deposited into eggs regulate 
embryonic development before activation of the zygotic genome’. 
In plants, an analogous period of prolonged maternal control over 
embryogenesis is thought to occur based on some gene-expression 
studies**. However, other gene-expression studies and genetic 
analyses show that some transcripts must derive from the early 
zygotic genome’ “, implying that the prevailing model does not 
fully explain the nature of zygotic genome activation in plants. To 
determine the maternal, paternal and zygotic contributions to the 
early embryonic transcriptome, we sequenced the transcripts of 
hybrid embryos from crosses between two polymorphic inbred lines 
of Arabidopsis thaliana and used single-nucleotide polymorphisms 
diagnostic of each parental line to quantify parental contributions. 
Although some transcripts seemed to be either inherited from 
primarily one parent or transcribed from imprinted loci, the vast 
majority of transcripts were produced in near-equal amounts from 
both maternal and paternal alleles, even during the initial stages of 
embryogenesis. Results of reporter experiments and analyses of 
transcripts from genes that are not expressed in sperm and egg 
indicate early and widespread zygotic transcription. Thus, in con- 
trast to early animal embryogenesis, early plant embryogenesis is 
mostly under zygotic control. 

The prevailing model for the maternal-to-zygotic transition in plants 
proposes that most early embryonic messenger RNAs are maternally 
derived transcripts, resulting either from maternal inheritance or from 
higher transcriptional activity of maternally derived genes until the 
globular stages (in which the embryo proper has between ~32 to 
>100 cells)**'*. Because fundamental patterning events, including api- 
cal-basal and radial axis formation'*”’, occur during the preglobular 
stages, this model implies that key cell-specification decisions are mostly 
under maternal control. However, this model is difficult to reconcile 
with other studies that report equivalent maternal and paternal expres- 
sion of interrogated genes in preglobular stages*”"! and zygotic-recessive 
behaviour of mutants with preglobular developmental phenotypes’**. 

To determine the origins of embryonic transcripts globally, we 
crossed polymorphic Col-0 and Cvi-0 Arabidopsis thaliana accessions 
and performed RNA-seq on poly(A)* RNA isolated from hybrid 
embryos with either one-to-two, eight, or ~32 cells in the embryo 
proper (hereafter referred to as 1-cell/2-cell, 8-cell and ~32-cell 
embryos). To control for inherent expression differences between 
Col-0 and Cvi-0 loci, the same procedure was performed using 
embryos derived from reciprocal crosses. lumina sequencing of the 
six samples yielded 73,955,956 reads that both perfectly and uniquely 
matched the transcribed regions of 23,874 genes (Supplementary 
Table 1). Overall, transcript levels from the same stage but different 
reciprocal crosses were highly correlated (r= 0.96; Supplementary 
Fig. 1). This reproducibility and sequencing depth indicated that our 
results would be informative for inferring the maternal, paternal and 
zygotic contributions to the early embryonic transcriptome. 

The prevailing model for the maternal-to-zygotic transition predicted 
that at the early embryonic stages transcripts would derive primarily 


from maternal alleles, and then at the globular stage they would derive 
more evenly from both alleles because at this stage the zygotic genome 
would be active. In contrast to this expectation, we found equal 
amounts of paternally and maternally derived reads at all three stages, 
including the 1-cell/2-cell and 8-cell embryos (Fig. la). When 
examining transcripts with at least five reads overlapping single- 
nucleotide polymorphisms (SNPs) in each cross, most were expressed 
equally from the maternal and paternal alleles, even at the earliest stage 
(Fig. 1b and Supplementary Data 1; note that in the Col-0 x Cvi-0 
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Figure 1 | Maternal and paternal genomes contribute equally to the early 
embryonic transcriptome. a, Proportion of RNA-seq reads overlapping 
maternal and paternal SNPs within transcribed regions of annotated genes. 
Drawings illustrate stages of isolated embryos, coloured to represent the 
diversity of cell lineages in each stage. Pie charts show proportions of reads 
overlapping maternal and paternal SNPs. b, Distributions of maternal-to- 
paternal ratios (logs) in 1-cell/2-cell (top), 8-cell (middle) and ~32-cell 
(bottom) embryos. Maternal-to-paternal ratios were calculated for transcripts 
that had =5 SNP-overlapping reads from the Col-0 X Cvi-0 cross (green; 

n > 7,388 for each stage) or the Cvi-0  Col-0 cross (purple; n > 8,071 for each 
stage), and transcripts that had =5 SNP-overlapping reads from both 
reciprocal crosses (black; n > 6,892 for each stage). Binomial modelling 
determined the distributions expected for transcripts examined in both 
reciprocal crosses if there had been no parent-of-origin effects and no noise 
apart from stochastic counting variability (grey dashed lines). 


1Whitehead Institute for Biomedical Research, 9 Cambridge Center, Cambridge, Massachusetts 02142, USA. *Howard Hughes Medical Institute and Department of Biology, Massachusetts Institute of 


Technology, Cambridge, Massachusetts 02139, USA. 


00 MONTH 2012 | VOL 000 | NATURE|1 


©2012 Macmillan Publishers Limited. All rights reserved 


LETTER 


cross, the maternal parent is Col-0, as the convention is to write the 
maternal parent first). In each cross, small but discernable subpopula- 
tions of transcripts were derived predominantly from either maternal 
or paternal alleles. However, in the respective reciprocal cross those 
same transcripts tended to derive from the opposite parent, which 
showed that rather than arising from parent-of-origin effects, these 
subpopulations arose mostly from genotypic effects, that is, from pref- 
erential expression from either the Col-0 or Cvi-0 alleles. Thus, when 
considering results of both crosses together, no overall maternal (or 
paternal) bias was observed, and at each stage the distribution of 
maternal-to-paternal ratios resembled that predicted for equal contri- 
bution from both alleles, with most of the variability explained by 
stochastic counting statistics, as modelled by the binomial distribution 
(Fig. 1b). The remainder of the variability was attributed to both addi- 
tional noise (both experimental and biological) and a small subset of 
transcripts with parent-of-origin effects, for which the number that 
were maternally enriched approximately equalled the number that 
were paternally enriched. 

Because most transcripts were derived from the maternal and paternal 
genomes in near-equal amounts, we hypothesized that either equal 
amounts of each transcript were inherited upon fertilization or the 
zygotic genome was activated much earlier than previously proposed. 
Supporting the latter possibility, the fold-change distributions centred 
at 0.0 (log,), showing no hint of the maternal bias that would be 
expected if the egg, with its larger cytoplasm, contributed more RNA 
than the sperm (Fig. 1b). In addition, 543 transcripts had at least four- 
fold higher reads per million (genome- and cDNA-matching reads) 
values at the 8-cell stage than at the 1 -cell/2-cell stages, which suggested 
active transcription of the corresponding genes between the 1-cell/2- 
cell and 8-cell stages (as the alternative model of differential mRNA 
stability would require a large decrease in total mRNA between these 
two stages; Supplementary Data 1). Furthermore, 1,138 transcripts that 
were previously called undetectable in both egg and sperm microarray 
data sets'*"° were among the top 50% most abundant transcripts in our 
1-cell/2-cell data sets (Supplementary Data 1). These results, taken 
together with the observation that transcripts for RNA polymerase II 
subunits were among the most abundant in the early embryo (Sup- 
plementary Table 2), suggested that during the initial stages of embryo- 
genesis many transcripts are transcribed from both maternal and 
paternal alleles, and that this transcription, combined with turnover 
of inherited transcripts, quickly overwrites most parent-of-origin 
biases present when the egg and sperm first fuse. 

To test directly whether maternally and paternally inherited genes are 
transcribed in very early embryos, we used the LhG4/pOp transactiva- 
tion system””. In this system, one parent contained a transgene encoding 
the LhGé4 transcription factor under the control of either the RPS5A or 
the UBI3 promoter (chosen because they generate products that can be 
found in the early embryo”*'), whereas the other parent harboured a 
nuclear-localized green fluorescent protein (GFP) reporter transgene 
under the control of the artificial pOp promoter to which LhG4 binds 
and transcriptionally activates (pOp::GFP). Because one gamete con- 
tributed the reporter gene and the other gamete contributed its activator, 
any reporter expression dependent on the activator could not occur until 
after the zygotic genome was transcriptionally active. No GFP signal was 
observed in embryos carrying the pOp::GFP reporter gene but no 
activator, which confirmed the absence of leaky expression in either 
the gametes or embryo (Fig. 2). GFP signal was detected in zygotes 
within 4-8 h after fertilization from crosses that brought the reporter 
together with its activator, regardless of whether the reporter was inherited 
through the egg or sperm (Fig. 2). The GFP signal in the early zygote was 
not as strong as that in the endosperm, which might indicate a slight 
delay in activation of the genome of the very early zygote or might result 
from more robust transcription in the endosperm. Nonetheless, when 
considered together our results demonstrate that both maternally and 
paternally inherited chromosomes are transcriptionally active at least in 
1-cell embryos and most likely before. 
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Figure 2 | Maternally and paternally inherited transgenes are transcribed in 
the initial stages of embryogenesis. Confocal scanning laser microscopy 
images show early embryos from the indicated crosses, which were between 
plants with a transgenic reporter (pOp::GFP) and either plants without an 
activator (wild type) or plants with a transgenic activator driven from either the 
RPSSA or the UBI3 promoter (pRPS5SA::LhG4 or pUBI3::LhG4, respectively). 
Embryonic stages and the approximate number of hours after fertilization 
(h.a.f.) are shown to the left. GFP signal is indicated by green nuclei; 
background autofluorescence is shown in red. Embryos are outlined in white 
for clarity. Asterisks indicate endosperm nuclei. Of the seeds with GFP signal in 
endosperm (n in parentheses), the percentage that also had GFP signal in 
embryos is indicated below each image. In endosperm, expression from 
maternal alleles is expected to be twofold higher than that from paternal alleles, 
but this difference is not observed in these images because the GFP signal in the 
endosperm was beyond its dynamic range. Scale bars represent 10 1m. 


Having established that most early embryonic transcripts derived 
equally from the maternal and paternal genomes, we turned our atten- 
tion to the few that might be preferentially inherited or preferentially 
expressed from the maternal or paternal alleles. For most early embry- 
onic transcripts the genotype of the allele had a larger affect on tran- 
script levels than did parent of origin (Figs 1b and 3a). Indeed, 
hundreds of transcripts were preferentially expressed from either the 
Col-0 or Cvi-0 allele irrespective of their parent of origin (Supplemen- 
tary Data 2), presumably as direct consequences of either DNA- 
sequence or epigenetic differences between the Col-0 and Cvi-0 alleles. 
This prevalence of genotypic/epigenotypic effects over parent-of- 
origin effects led to significant negative correlations between the 
maternal-to-paternal ratios at each stage (Fig. 3a, Pearson’s r < 
-0.43, P< 10°"). Nonetheless, at each stage a small subset of tran- 
scripts passed our cutoffs for classification as maternally or paternally 
enriched, that is, fourfold maternally or paternally enriched in each 
cross, and not exceeding the false discovery rate (FDR) threshold of 0.05 
(Pearson’s va tests, Benjamini and Hochberg FDR). With these cutoffs, 
77 and 45 transcripts were designated maternally and paternally 
enriched, respectively, in at least one stage (Fig. 3a and Supplemen- 
tary Data 3). For five transcripts, an independent assay involving 
diagnostic cleavage of an amplified polymorphic sequence was per- 
formed, and for all five, the parent-of-origin effects were confirmed 
(Supplementary Fig. 2). 
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Figure 3 | Dozens of transcripts are preferentially inherited or 
preferentially expressed. a, Maternal-to-paternal ratios for transcripts in 
1-cell/2-cell, 8-cell and ~32-cell embryos. Ratios for transcripts that had =8 
SNP-overlapping reads in both the Col-0 x Cvi-0 and Cvi-0 X Col-0 crosses are 
plotted. Points, as well as the numbers, that passed cutoffs for maternally or 
paternally enriched transcripts (see text) are indicated in red and blue, 
respectively. b, Heat maps of both maternal-to-paternal ratios and changes in 
transcript levels for transcripts passing cutoffs as maternally (top) or paternally 
(bottom) enriched in at least one of the three stages. Maternal-to-paternal ratios 
are colour-coded (key), unless the transcript had <8 SNP-overlapping reads in 
one cross, in which case the corresponding heat-map value is not coloured. 
Changes in the levels of a transcript (reads per million) relative to its mean are 
colour-coded (key). Numbers to the left indicate candidates for both preferential 
inheritance and preferential expression (1), preferential inheritance (2) or 
preferential expression (3). Numbers to the right highlight a subset of the class 2 
transcripts, which had relatively high levels in 1-cell/2-cell embryos compared to 
~32-cell embryos and thus potentially derived mainly from preferential 
inheritance (2A), ora subset of class 3 transcripts, which had relatively low levels 
in 1-cell/2-cell embryos compared to ~32-cell embryos and thus potentially 
derived mainly from parent-of-origin-specific transcription (3A). 


Although the maternal enrichment of some transcripts might be 
due to their transport from maternal sporophytic tissues, such a 
mechanism cannot explain paternal enrichment. Some transcripts 
had strong parent-of-origin effects in 1-cell/2-cell embryos, suggesting 
that they were preferentially inherited from one parent (Fig. 3b). 
Confirmation that these biases are indeed due to inheritance would 
substantially add to the single previously documented example of an 
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inherited transcript in plants. Other transcripts had stronger parent- 
of-origin effects at later stages, suggesting that they were preferentially 
expressed from either the maternally or the paternally inherited allele 
(Fig. 3b). This potential parent-of-origin-specific expression implied a 
form of imprinting in the Arabidopsis embryo. Genome-wide 
approaches similar to ours but looking much later after fertilization 
greatly expanded the list of genes with parent-of-origin-specific 
expression in the endosperm but did not identify such genes in 
embryos***. Thus, the imprinting-like phenomenon that we observed 
in early embryos is short lived, which suggests that it differs from the 
more persistent imprinting previously characterized in either endosperm 
or mammals”. For example, it might involve alternative chromatin states 
inherited from the egg and sperm, which after several cell divisions 
equilibrate between the two alleles. Such a mechanism would not 
necessarily require DNA methylation, although we note that in the 
only previous report of imprinting in plant embryos (at the MEE1 
locus of maize), methylation marks are lost from maternal alleles at 
the initial stages of embryogenesis, but then re-established to match the 
paternal alleles at later stages”*. 

Our results showing that during the initial stages of Arabidopsis 
embryogenesis both the maternal and paternal genomes are active 
and make essentially equivalent contributions to the embryonic tran- 
scriptome are in marked contrast to a recent report that ~88% of the 
2-cell/4-cell embryonic transcriptome is derived from the maternal 
genome’. That study, published while our manuscript was in prepara- 
tion, used SNPs in the transcriptomes of hybrid embryos derived from 
a cross between the Ler-1 (maternal parent) and Col-0 (paternal 
parent) accessions to estimate maternal and paternal genomic contri- 
butions. Because results from the reciprocal cross were not reported, 
the previous study could not distinguish parent-of-origin effects from 
genotypic effects. However, because an analysis unable to account for 
genotypic effects would be expected to misidentify transcripts as 
paternally derived as frequently as it misidentifies them as maternally 
derived, we considered other possibilities for the discrepancy between 
their results and ours. Our pilot studies had shown that early embryos 
must be extensively washed to prevent seed-coat RNA contamination, 
and indeed we found evidence that the embryo RNA samples of ref. 2 
contained large amounts of seed-coat mRNA (Supplementary Fig. 3). 
Because the seed coat is a maternal tissue, this contamination 
explained why they observed such a large bias in maternal RNAs. 

Genes zygotically required for the initial zygotic division have been 
identified in Arabidopsis”'*’. Moreover, paternal gene expression has 
been detected for 24 endogenous genes and a transgene in maize 
zygotes*”’, and de novo transcription has been demonstrated recently 
in tobacco zygotes”*. Rather than interpreting these findings as excep- 
tions to the model or as differences between species, our transcriptome- 
wide analyses and reporter data strongly support the proposal of an 
alternative model for the maternal-to-zygotic (or, to put it more pre- 
cisely, the maternal/paternal-to-zygotic) transition in plants. In this 
model, the zygotic genome is activated within the first few hours after 
fertilization, and equal transcription of both maternal and paternal 
alleles generates most of the early embryonic transcriptome at the 1-cell 
stage or before. Our model implies that although the maternal and 
paternal gene products are inherited in the zygote upon fertilization 
and contribute to its development”, the zygotic gene products are the 
ones that primarily control the initial cell-specification events of 
embryogenesis. 


METHODS SUMMARY 


Col-0 and Cvi-0 seed stocks were obtained from the Arabidopsis Biological 
Resource Center. Flowers were emasculated 1 day before crossing, and pools of 
20 1-cell/2-cell, 8-cell and ~32-cell embryos were hand-dissected approximately 
40, 64 and 78 h after pollination, respectively. RNA isolation, linear amplification 
of poly(A) RNA and strand-specific RNA-seq were as described’, except that 
embryos were extensively washed before RNA isolation. After removing adaptor 
sequences, reads were mapped to both the A. thaliana genome and transcript 
models (Col-0 genome reference, TAIR10) with the Bowtie short-read aligner”. 
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Reads were also mapped to a ‘pseudo’ Cvi-0 genome and Cvi-0 transcript models, in 
which SNPs in the Col-0 genome and transcript models were replaced with Cvi-0 
variants (ftp://ftp.arabidopsis.org/home/tair/Sequences/Polymorphism_datasets/ 
Ecker_Cvi_snps.txt) generated by the 1001 Genomes Project. Reads that both 
perfectly and uniquely matched either the genomes or transcript models were 
retained, and those overlapping transcribed regions of annotated genes were eval- 
uated for overlap with SNPs. Those diagnostic of one genome or the other were 
tallied after normalizing for differences in sequencing yield between samples. Tallies 
for multiple SNPs within the same gene were combined before calculating mater- 
nal-to-paternal ratios for each transcript. 


Full Methods and any associated references are available in the online version of 
the paper at www.nature.com/nature. 
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METHODS 


Plant growth and embryo dissections. Col-0 and Cvi-0 seed stocks were obtained 
from the Arabidopsis Biological Resource Center. Plants were grown at 22°C ina 
Conviron growth chamber with a 16-h light/8-h dark cycle. Flowers were emas- 
culated 1 day before crossing, and pools of 20 1-cell/2-cell, 8-cell and ~32-cell 
embryos were hand-dissected approximately 40, 64 and 78h after pollination, 
respectively. 

Embryo dissections were performed in water on glass slides using a SZH stereo 
microscope (Olympus) with 128 magnification. Tungsten needles (Fine Science 
Tools) were used to cut open the seeds and remove the embryos from the rest of the 
seed material. For each embryo, a heat-pulled borosilicate glass microcapillary 
pipette (Harvard Apparatus) was used to move it to a depression slide containing 
50 pl water. The embryo was then washed two additional times by serial transfer to 
slides containing 50 11 water, using a new pipette and fresh water for each transfer. 
Washed embryos were collected in 30 pil of RNAlater (Ambion). 

RNA-seq and analyses. RNA isolation, linear amplification of poly(A) RNA and 
strand-specific RNA-seq were as described’, except that embryos were extensively 
washed before RNA isolation. After removing adaptor sequences, reads were 
mapped to both the A. thaliana genome and transcript models (Col-0 genome 
reference, TAIR10) with the Bowtie short-read aligner”’. Reads were also mapped 
to a ‘pseudo’ Cvi-0 genome and Cvi-0 transcript models, in which SNPs in the 
Col-0 genome and transcript models were replaced with Cvi-0 variants (ftp://ftp. 
arabidopsis.org/home/tair/Sequences/Polymorphism_datasets/Ecker_Cvi_snps.txt) 
generated by the 1001 Genomes Project. Reads that both perfectly and uniquely 
matched either the genomes or transcript models were retained, and those over- 
lapping transcribed regions of annotated genes were evaluated for overlap with 
SNPs. To normalize for differences in library sizes, the numbers of reads represent- 
ing each transcript were divided by the total number of reads matching the genome 
and transcript models. SNP-overlapping reads were assigned to one of the parental 
genomes and tallied for each transcript, combining tallies for multiple SNPs within 
the same transcript. 

Statistical analyses and graphics. Statistical analyses were performed and asso- 
ciated graphics were generated with the R statistical computing base package”. 
The SciPy tools for Python were used to calculate ’ test statistics and associated 
probabilities. 

Transgenic lines. The pBIN+LhG4-GW vector was generated by digesting both 
the pBIN+LhG4 vector and the attR1/attR2-containing vector GWRFa::pET42a 
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with KpnI and ligating together the appropriate digestion fragments. RPS5A 
and UBI3 promoter fragments were amplified with appropriate primer pairs, 
cloned into pENTR-D/TOPO (Invitrogen) and then recombined with 
pBIN+LhG4-GW to generate pRPS5A::LhG4-GW and pUBI3::LhG4-GW, 
respectively. The pV-TOP-GFP vector was generated by double-digesting 
pV-TOP(E3) and NLSGFPF1RI1::pCR8 with HindIII and Pmel, and ligating 
together the appropriate digestion fragments to replace the GUS reporter with a 
nuclear-localized GFP reporter. NLSGFPF1R1::pCR8 was created by amplifying 
pCGTAG with appropriate primers and cloning into pCR8/GW-TOPO 
(Invitrogen). pOp::GFP was then generated by digesting pV-TOP-GFP and 
GWFIR3::pENTR with Sall, and ligating together the appropriate digestion frag- 
ments. GWF1R3::pENTR was created by amplifying the attR1/attR2 Gateway 
cassette from GWRFa::pET42a with a series of overlap extension PCRs to remove 
internal Sall sites and add SalI sites on the ends of amplicons, which were then 
cloned into pENTR/D-TOPO (Invitrogen). Oligonucleotides used to generate the 
above constructs are listed in Supplementary Table 3. All constructs were trans- 
formed into Col-0 by Agrobacterium-mediated transformation. Because the 
progeny of T, lines gave more robust GFP signal than did progeny of established 
lines, T, lines were used for all reporter crosses. Embryos from crosses using at least 
13 pOp::GFP, four pRPS5SA::LhG4 and six pUBI3::LhG4 independent T, lines were 
examined. 

Microscopy. For confocal scanning laser microscopy, developing seeds were 
mounted in 50mM potassium phosphate buffer, pH 7.2, with 5% glycerol. A 
488-nm laser on a Zeiss LSM 510 confocal microscope was used to excite GFP 
and seed autofluorescence, and images were collected at 505-530 nm and 644— 
719 nm, respectively. 

Analyses of cleaved amplified polymorphic sequences. Transcripts containing a 
SNP that created or disrupted a restriction site in the corresponding cDNA were 
selected for analysis. Random hexamers (Invitrogen) were used for reverse transcrip- 
tion with Superscript III (Invitrogen) and primers flanking the SNPs were used for 
amplification (Supplementary Table 3). The amplified DNA was digested with the 
appropriate restriction enzyme (New England Biolabs; Supplementary Table 4). 
Digestion products were resolved on 2% agarose gels stained with ethidium bromide, 
and bands were quantified with Quantity One 1-D analysis software (Bio-Rad). 


30. R Development Core Team. R: A Language and Environment for Statistical 
Computing (R Foundation for Statistical Computing, Austria, 2010). 
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High-throughput decoding of antitrypanosomal 
drug efficacy and resistance 
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Mark C. Field’, Matthew Berriman? & David Horn! 


The concept of disease-specific chemotherapy was developed a 
century ago. Dyes and arsenical compounds that displayed 
selectivity against trypanosomes were central to this work’’, and 
the drugs that emerged remain in use for treating human African 
trypanosomiasis (HAT)’. The importance of understanding the 
mechanisms underlying selective drug action and resistance for 
the development of improved HAT therapies has been recognized, 
but these mechanisms have remained largely unknown. Here we use 
all five current HAT drugs for genome-scale RNA interference target 
sequencing (RIT-seq) screens in Trypanosoma brucei, revealing the 
transporters, organelles, enzymes and metabolic pathways that 
function to facilitate antitrypanosomal drug action. RIT-seq profil- 
ing identifies both known drug importers*’ and the only known pro- 
drug activator®, and links more than fifty additional genes to drug 
action. A bloodstream stage-specific invariant surface glycoprotein 
(ISG75) family mediates suramin uptake, and the AP1 adaptin 
complex, lysosomal proteases and major lysosomal transmembrane 
protein, as well as spermidine and N-acetylglucosamine bio- 
synthesis, all contribute to suramin action. Further screens link 
ubiquinone availability to nitro-drug action, plasma membrane 
P-type H*-ATPases to pentamidine action, and trypanothione 
and several putative kinases to melarsoprol action. We also demon- 
strate a major role for aquaglyceroporins in pentamidine and 
melarsoprol cross-resistance. These advances in our understanding 
of mechanisms of antitrypanosomal drug efficacy and resistance will 
aid the rational design of new therapies and help to combat drug 
resistance, and provide unprecedented molecular insight into the 
mode of action of antitrypanosomal drugs. 

African trypanosomes are transmitted by the tsetse insect vector and 
circulate in the bloodstream and tissue fluids of their mammalian hosts. 
These protozoan parasites cause HAT, also known as sleeping sickness, 
and the livestock disease known as Nagana. HAT is typically fatal if 
there is no chemotherapeutic intervention. The public health situation 
has improved recently with increased monitoring and chemotherapy 
averting more than 1.3 million disability-adjusted life years (DALYs) in 
the year 2000 and the estimated number of cases at less than 70,000 in 
2006 (ref. 7). However, therapies have many problems, including severe 
toxicity and increasing resistance, which is a major concern owing to 
the absence of a vaccine or therapeutic alternatives’. The current HAT 
therapies are pentamidine or suramin, which are only suitable for the 
first stage of the disease before central nervous system involvement, and 
eflornithine, nifurtimox or melarsoprol for advanced disease* (Sup- 
plementary Table 1). All of these drugs were developed well before 
the advent of molecular, target-based therapy and, with the exception 
of eflornithine, they elicit their antitrypanosomal effects by disrupting 
unknown targets. HAT treatment failure rates were reported to be 
increasing for suramin, when this drug was still in use in West Africa 
in the 1950s*, and melarsoprol treatment failure is a current and 
increasing problem’. 


We used genome-scale tetracycline-inducible RNA interference 
(RNAi) library screens in T. brucei to identify the genes that contribute 
to drug action. In these screens, replicating cells only persist in an 
otherwise toxic environment if knockdown confers a selective advantage 
(Fig. 1a); note that knockdown is not expected to identify drug targets. 
The RNAi library consists of ~750,000 clones, each transformed with 
one RNAi construct, and represents >99% of the approximately 7,500 
non-redundant T. brucei gene set. Because each gene is identified by an 
average of approximately five different RNAi sequences, true leads can 
be identified with high confidence and potential off-target false leads can 
be minimized (see Supplementary Methods). Screens were performed 
using all current HAT drugs and each yielded a population of cells 
displaying an inducible drug resistance phenotype after eight or fourteen 
days of selection (Fig. 1b and Supplementary Fig. 1). Genomic DNA 
from these cells was subjected to RIT-seq’® to create profiles of RNAi 
targets associated with increased resistance and to identify the genes that 
contribute to drug susceptibility. Genome-wide association maps show 
read density for 7,435 T. brucei genes (Fig. 1c). We defined genes with 
‘primary signatures’ as those associated with two or more independent 
RIT-seq tags, each with a read density of >99; the screens yielded 55 of 
these signatures (Fig. 1c; see Supplementary Methods and Supplemen- 
tary Data 1. Previous work linked the P2 adenosine transporter 1 (AT1) 
to melarsoprol uptake*’’”’, an amino acid transporter family member 
(AAT6) to eflornithine uptake*’*"* and a nitroreductase (NTR) to 
nifurtimox activation®"*. Each of these genes is identified on the appro- 
priate genome-wide association map (Fig. 1c), providing validation for 
our screens and indicating excellent genome-scale coverage in the 
RNAi library. Selected read-density signatures that establish new 
genetic links to drug susceptibility are shown in Fig. 1d. 

The known eflornithine transporter is the only primary signature 
from the eflornithine screen. By contrast, the suramin screen revealed 
28 genes associated with primary signatures (Fig. 1c and Supplemen- 
tary Data 1). Suramin, which has been used for HAT therapy since 
the 1920s", is a colourless sulphated napthylamine related to trypan 
red. Because this drug has a strong negative charge, it cannot cross 
lipid membranes by passive diffusion. Genes that are linked to the 
action of suramin encode ISG75, the function of which is unknown", 
four lysosomal proteins (the cathepsin L (CatL) and CBP1 peptidases, 
p67 and Golgi/lysosomal protein 1 (GLP1)), all four subunits of 
the adaptin complex (AP1), which are involved in endosomal, 
clathrin-mediated trafficking, and multiple spermidine and 
N-acetylglucosamine biosynthetic enzymes (Supplementary Fig. 2 
and Supplementary Data 1). 

Eight of these genes were selected for further analysis. We assembled 
multiple independent inducible RNAi strains for each gene and con- 
firmed that knockdown (Fig. 2a and Supplementary Fig. 3) increased 
suramin resistance in every case (Fig. 2b and Supplementary Fig. 4). 
We then determined subcellular localization for the putative major 
facilitator superfamily transporter (MFST); the tandem of three closely 
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Figure 1 | Identification of drug efficacy determinants in T. brucei. a, A 
schematic showing the RNAi library screening approach. The expected 
outcomes are given for RNAi targets that fail to affect drug resistance (black), 
increase resistance to drug A (blue), drug B (orange) or both (green). b, Each 
screen yielded a population displaying tetracycline (Tet)-inducible (RNAi- 
dependent) drug-resistance (see Supplementary Fig. 1). The plot indicates the 
proportion of the resistance phenotype that is tetracycline inducible. 

c, Genome-wide RIT-seq profiles. Each map represents a non-redundant set of 
7,435 protein-coding sequences. Red bars represent ‘primary’ read-density 
signatures. Black bars represent all other signatures of >50 reads (see 
Supplementary Data 1). All three expected ‘hits’, AAT6, ATI and NTR, are 
indicated. d, Selected signatures. Each peak represents a unique RIT-seq tag. 
‘+’, numbers of additional genes identified in each category. See 
Supplementary Fig. 2 for details and additional signatures. 


related MFST genes gave the strongest read-density signature in the 
suramin screen and the greatest half-maximum effective concentra- 
tion (ECs) increase (> tenfold) following knockdown (Fig. 2b). In 
contrast to a putative ubiquitin hydrolase (UBH1) identified by the 
screen, MFST and a member of the endomembrane EMP70 family 
partitioned into the T. brucei membrane fraction, as expected (Fig. 2c), 
and MEST localized to the lysosome as did the major lysosomal type I 
membrane glycoprotein, p67 (ref. 17), which was also identified in the 
screen (Fig. 2d). Because ISG75 trafficking is ubiquitin dependent'’, we 
investigated whether UBH1 influenced ISG75 expression. UBH1 
knockdown reduced ISG75 but not ISG65 expression (Fig. 2e), suggest- 
ing that de-ubiquitination by UBH1 specifically affects ISG75 copy 
number; clearly this mimics the direct effect of RNAi against ISG75. 
A vacuolar protein sorting factor, Vps5, which positively controls 
ISG75 expression”’, and a second putative ubiquitin hydrolase, were 
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Figure 2 | A network of proteins link ISG75, endocytosis and lysosomal 
functions to suramin action. a, Western blots demonstrate knockdown; 
Coomassie stains serve as loading controls. Tags, green fluorescent protein (GFP) 
and 12x MYC epitope (12M). See Supplementary Fig. 3 for growth curves. 

b, Endosomal and lysosomal factors and ISG75 contribute to suramin action. 
Error bars, s.d. from independent RNAi strains (see Supplementary Fig. 4). 

c, MFST and EMP70 are membrane associated. The western blots show 
supernatant (S), wash (W) and pellet (P; membrane fraction). d, MFST co- 
localizes with lysosomal protein, p67, but not recycling endosomes (Rab11). 
Dashed boxes, areas magnified in fluorescent images. e, Knockdown of UBH1 
specifically decreases ISG75 expression. f, ISG75 mediates suramin binding. Error 
bars, s.d. from duplicate experiments. P value from Student's f-test. ISG75 
knockdown is shown. Scale bar, 5 tum. g, The CatL—CatB, and ODC inhibitors 
FMK024 and eflornithine, respectively, antagonize suramin action. Isobolograms 
showing 50% fractional inhibitory concentrations (FICs). The solid lines indicate 
antagonism. The dashed lines indicate expected outcomes for no interaction. 


also identified by the screen (see Supplementary Fig. 2 and Supplemen- 
tary Data 1), suggesting that ISG75 copy number is highly connected to 
suramin resistance. To investigate whether ISG75 contributes to 
suramin binding, we performed whole-cell binding assays using 
>[H]-labelled suramin. Cells that were depleted for ISG75 displayed 
significantly and specifically reduced suramin binding (Fig. 2f). 

We observed a greater than fourfold increase in ECs after knock- 
down of the CatL-like protease known as brucipain, another abundant 
lysosomal protein”, and an orthogonal assay using a dual-specificity 
CatL—CatB inhibitor revealed inhibitor antagonism (Fig. 2g), indicating 
that protease activity enhances suramin toxicity. Taken together, the 
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results demonstrate a central role for lysosomal functions in suramin 
action. As four enzymes that are involved in spermidine biosynthesis, 
including ornithine decarboxylase (ODC), were linked to suramin 
action (Supplementary Data 1), we used eflornithine to specifically 
inhibit ODC, which again revealed inhibitor antagonism (Fig. 2g; Sup- 
plementary Table 1). Thus, ODC activity enhances suramin toxicity, 
probably through spermidine biosynthesis. Suramin endocytosis”' and 
intralysosomal accumulation” have previously been demonstrated in 
T. brucei and an acquired suramin resistance phenotype was stable in 
bloodstream stage T. brucei but was not expressed in the insect stage”. 
The RIT-seq profile reported here, bloodstream-stage-specific expres- 
sion of ISG75"° and strong downregulation of endocytic and lysosomal 
activities in the insect stage’, are all consistent with stage-specific, 
intralysosomal accumulation of suramin. 

Work with dyes and arsenicals revealed the first examples of resist- 
ance to chemotherapy a century ago and, based on cross-resistance, it 
was deduced that there are shared mechanisms contributing to the 
action of certain ‘parasitotropic’ compounds’. Among current HAT 
therapies, cross-resistance has been documented only for melarsoprol 
and pentamidine’, but our understanding of the mechanism remains 
incomplete. Both drugs enter trypanosomes through the P2 AT1 but 
additional, dual-specificity transporters are predicted’. To identify cross- 
resistance mechanisms, we analysed all pair-wise comparisons among 
our screens (Fig. 3a). A single robust signature emerged, implicating 
two closely related aquaglyceroporins (AQPs)”* in melarsoprol and 
pentamidine cross-resistance. To directly test the role of the AQPs, 
we generated a strain that was deficient in aqp2 and aqp3 (aqp2/aqp3- 
null strain) (Fig. 3b). The ECso was increased more than 2-fold and 15- 
fold for melarsoprol and pentamidine, respectively, in aqp2/aqp3-null 
cells compared to wild-type cells (Fig. 3c). Our favoured hypothesis 
involves regulation of dual-specificity transporters by AQPs. 

The nifurtimox, pentamidine and melarsoprol screens yielded eight, 
nine and nine genes associated with primary signatures, respectively. 
The major primary signature in the nifurtimox profile identified the 
mitochondrial, flavin-dependent nitroreductase that activates this 
class of nitro pro-drugs®. We also identified the putative flavokinase 
that converts riboflavin to FMN, an essential nitroreductase cofactor’. 
Four additional signatures identified genes that encode proteins linked 
to ubiquinone biosynthesis (Supplementary Fig. 2 and Supplementary 
Data 1), in support of the hypothesis that nitroreductase, like NADH 
dehydrogenases, transfers electrons from NADH to ubiquinone to 
generate ubiquinol®. We assembled RNAi strains for one of these 
factors and demonstrated that knockdown increased the ECs, for 
nifurtimox by approximately 1.5-fold (Supplementary Fig. 5). Thus, 
six gene signatures support a dominant role for nitroreductase in 
nifurtimox activation and suggest that this is dependent upon the 
availability of the FMN cofactor and the natural substrate. 

Pentamidine is an aromatic diamidine, a nucleic acid binding 
drug that accumulates to millimolar concentrations and collapses 
trypanosome mitochondrial membrane potential’®. Two primary sig- 
natures from the pentamidine screen identify genes encoding P-type 
ATPases (Supplementary Fig. 2 and Supplementary Data 1), and one 
of these represents the plasma membrane H*-ATPases, HA1, HA2 
and HA3 (ref. 27). We assembled RNAi strains for these ATPases and 
demonstrated that knockdown increased the ECso for pentamidine 
more than eightfold (Supplementary Fig. 5), suggesting that an 
HA1-3 dependent proton motive force is required to drive pentamidine 
uptake. We used a similar approach to demonstrate a greater than 
twofold increase in the ECso for pentamidine following knockdown 
of a putative protein phosphatase (Supplementary Fig. 5). 

Melarsoprol acts primarily by forming a stable adduct with trypa- 
nothione, known as Mel T?’, but whether this adduct reduces or 
increases toxicity has remained unclear. The melarsoprol screen iden- 
tified a link to trypanothione synthase and trypanothione reductase 
(Supplementary Fig. 2 and Supplementary Data 1), suggesting that the 
Mel T adduct is toxic. Three other primary signatures identified an 
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Figure 3 | agp2/aqp3-null cells are melarsoprol, pentamidine cross- 
resistant. a, Analysis of read density for all (74,350) possible pair-wise 
comparisons of a non-redundant T. brucei gene set. E, eflornithine; M, 
melarsoprol; N, nifurtimox; P, pentamidine; S, suramin; X and Y, axes 
representing each data set. The box on the right shows the read-density 
signatures for this locus (Tb927.10.14160-70). b, AQP2 and AQP3 knockout 
was confirmed by Southern blot analysis. A, the region deleted; S, SacI; WT, 
wild type. Bars indicate probes. c, ECso analysis indicates melarsoprol, 
pentamidine cross-resistance in aqp2/aqp3-null cells. Error bars, s.d. from 
triplicate assays and independent null strains. 


over-representation (P = 2.3 X 10~°, x? test) of putative protein 


kinases (Supplementary Fig. 2 and Supplementary Data 1), and 
another signature identified a gene encoding a highly phosphorylated 
protein related to the amino-terminal segment of the large tumour 
suppressor, LATS1 (see Supplementary Fig. 2a). We used independent 
strains to confirm that LATS1-like knockdown increased the ECs9 for 
melarsoprol by approximately 1.5-fold (Supplementary Fig. 5). On the 
basis of these signatures, we suggest a role for a signalling cascade in 
melarsoprol susceptibility. Our findings are summarized in Fig. 4. In 
particular, we propose that suramin uptake occurs through ISG75- 
mediated endocytosis (Fig. 4a). Metabolic pathways that contribute 
to suramin or nifurtimox action are detailed in Fig. 4b. 

All but one of the current HAT drugs was developed in the absence 
of an understanding of the chemical-biological relationships under- 
lying toxicity or selectivity. Our RIT-seq profiles revealed more than 50 
T. brucei genes that enhance drug susceptibility, unearthing inter- 
actions that are largely inaccessible using other approaches. Notably, 
the knockdown approach and the sensitivity of RIT-seq allow access 
to essential proteins, complexes and pathways such as H*-ATPase, 
the adaptin complex and spermidine biosynthesis. Our results also 
show the utility of drugs as molecular probes for functional networks. 
In particular, the findings highlight factors that contribute to drug 
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Figure 4 | Determinants of drug efficacy in African trypanosomes. 

a, b, Proteins (red) and metabolites (green) that are linked to drug action. a, A 
schematic summarizing the findings from the RIT-seq screens. In the case of 
suramin, we propose that ISG75 binds the drug at the cell surface. ISG75 
trafficking then delivers the complex, through the flagellar pocket (FP), to the 
endosomal system, leading to accumulation in the lysosome where the drug is 
liberated by proteases. The MFST may deliver the drug to the cytosol. HAPT, 
high-affinity pentamidine transporter; LAPT, low-affinity pentamidine 
transporter; TS,, oxidised trypanothione; T[SH],, reduced trypanothione; 
UQ3, ubiquinone 9. b, Biosynthetic pathways that are linked to drug action. See 
Supplementary Data 1 for definitions and further details. 


accumulation or the generation of toxic metabolites, features that could 
be exploited to deliver or generate novel toxins. Additionally, absence or 
loss of function could explain innate or acquired resistance; suramin 
resistance or melarsoprol and pentamidine cross-resistance may be due 
to reduced MFST or AQP expression, respectively (for examples, see 
Supplementary Fig. 6). These advances in our understanding of drug- 
trypanosome interactions will facilitate rational approaches to the 
design of more efficacious and durable therapies, and will be useful 
for monitoring the emergence and spread of resistance. 


METHODS SUMMARY 


Assembly of the bloodstream-form T. brucei RNAi library and RIT-seq were 
reported previously'®. Briefly, a tetracycline-inducible RNAi plasmid library, 
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containing randomly sheared genomic fragments (with a mean fragment size of 
~600 bp) under the control of head-to-head, tetracycline-inducible phage T7 
promoters”, was targeted to a single genomic locus that had been validated for 
robust expression”’. The long double-stranded RNAs (dsRNAs) that were generated 
in the presence of tetracycline are processed to produce a pool of short interfering 
RNAs that programme the endogenous RNAi machinery to mediate sequence- 
specific destruction of the cognate messenger RNA. For this study, the library was 
grown under inducing conditions with drug selection, and genomic DNA was iso- 
lated from surviving populations. For RIT-seq profiling, adaptor-ligated sequencing 
libraries were prepared from each genomic DNA sample and used to amplify DNA 
fragments containing RNAi cassette-insert junctions in semi-specific PCR reactions; 
one primer was specific for the RNAi vector and the other for the Illumina adaptor. 
Size-selected DNA was sequenced with 76 cycle runs on an Illumina GAIL. 
Sequencing reads containing a nine-base RNAi cassette-insert junction sequence 
were then mapped to the T. brucei reference genome. In cases in which loss of 
function increases drug tolerance, RNAi-target sequence representation is increased 
relative to the otherwise susceptible population, revealing ‘hot spots’. Thus, RNAi 
target fragments serve as templates for the production of dsRNA and also provide 
unique sequence identifiers for each clonal population. 


Full Methods and any associated references are available in the online version of 
the paper at www.nature.com/nature. 
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METHODS 


T. brucei growth and drug selection. The bloodstream-form T. brucei MITat 1.2 
clone 221a RNAi library’? was derived using the randomly sheared genomic 
fragment (with mean fragment length ~600 bp) RNAi plasmid library’. The 
T. brucei RNAilibrary and 2T1 cells were maintained as described”. For selective 
screens, the RNAi library, maintained throughout at >5 x 10° cells, was induced 
with tetracycline (1 pg ml‘) for 24 h and then grown in medium containing 
tetracycline, plus each HAT drug at 0.5 X ECso to 3.5 X ECs (Supplementary 
Table 1 and Supplementary Fig. 1). All drug stocks were in dimethylsulphoxide. 

RIT-seq. Selected populations from each screen were assessed for tetracycline- 
dependent drug resistance. The RNAi target fragments provide unique identifiers 
for each clone in the population. As a quality-control step, PCR amplification, 
agarose gel fractionation and Sanger sequencing of the eluted products were 
performed as described", and followed with RIT-seq analysis'®. All nine genes 
that were identified by Sanger sequencing were associated with high-density 
Illumina read-counts (13,000 to 528,000; see Supplementary Data 1a). Briefly, 
we ran 76-cycle sequencing on an Illumina GAII; this generates sequence tags 
derived from the ends of the RNAi target fragments. Only sequences containing a 
terminal RNAi-vector junction sequence (GCCTCGCGA) were mapped to the 
T. brucei 927 reference genome” using the SSAHA sequence alignment algorithm*’. 
After mapping, for each protein coding sequence (CDS) in each experiment, we 
obtained a count of reads mapping; all genes associated with >9 reads are detailed 
in Supplementary Data 1b. We also browsed all read-density plots in Artemis”? for 
signatures that fell outside of CDSs to generate the full non-redundant ‘hit list’ 
detailed in Supplementary Data 1a. 

Read-density signatures. Genome coverage in the current RNAilibrary represents 
>99% of all genes, with 5 RNAi targets per gene on average’; shorter genes are 
expected to be represented by fewer RNAi targets. Our screens yielded 5-59 genes 
(0.07-0.8%) with a >RIT-seq tag (a tag with a read density of >99; the eflornithine 
screen yielded 5, the suramin screen 59 the nifurtimox screen 54, the pentamidine 
screen 17 and the melarsoprol screen 19). In each screen, at least one gene was 
associated with a ~°°°°RIT-seq tag (Supplementary Data 1a). From this set, we 
derived 55 genes with ‘primary signatures’, those associated with two or more 
>°RIT-seq tags. If these tags were randomly distributed, we would expect a single 
primary signature from 300 screens using eflornithine or from two screens using 
suramin, assigning a high degree of confidence to the vast majority of observed 
primary signatures (Supplementary Data 1a). The nifurtimox output is unusual 
compared to the other outputs and may reflect drug-mediated mutagenesis”; for 
example, inactivating mutations within NTR may prolong the survival of clones 
carrying unrelated RNAi targets. However, even limited tetracycline-regulated drug 
resistance (Fig. 1b) and a high number of sequence tags in the nifurtimox screening 
profile (Supplementary Data 1 and Fig. 1c) had little impact on primary signature 
confidence. Many of the 130 genes that are associated with ‘secondary signatures’ in 
Supplementary Data la may also reflect mechanisms of drug action, but here we 
only considered seven of these genes that were linked to a common function with a 
primary hit (Supplementary Fig. 2). We observe that, on average, 3.5 tags per gene 
are associated with the 24 primary, single copy genes that are shown in Sup- 
plementary Fig. 2. Minimal library propagation could explain a modest reduction 
in coverage but we suggest that reduced coverage in the current RIT-seq outputs is 
primarily explained by major fitness defects following knockdown. 

Plasmid construction and strain assembly. The AQP locus was disrupted by 
replacement of a 4,772-bp (AQP2 and AQP3 ) fragment with NPT and BLA 
selectable markers (the T. brucei genome is diploid). Gene-specific RNAi frag- 
ments of 400-600 bp or 200 bp, to facilitate moderate knockdown in the case of the 
known essential gene p67 (ref. 17), were amplified using PCR primers designed 
using RNAit* and cloned into pRPaiSL for the generation of stem-loop, ‘hairpin’ 
dsRNA as the trigger for RNAi**. We used a long, 400-600-bp RNAi target 
fragment for CatL because RNAi previously produced no growth defect*’. 
However, cells retained 35% CatL activity in that study”, probably explaining 
why we see a major growth defect when expressing a more potent stem-loop 


dsRNA (Supplementary Fig. 3). For epitope tagging at native loci, C-terminal 
fragments, or an N-terminal fragment (UBH1), were amplified and cloned in 
pNATx"“S and pNAT™®x (ref. 36), respectively. Constructs were introduced 
into 2T1 cells as described”®. Full oligonucleotide details are available on request. 
Strain analysis. Cumulative growth curves were generated from cultures seeded 
at 10° cells ml, counted ona haemocytometer and diluted back to 10° cells ml~! 
as necessary. For ECs9 assays, RNAi strains were pre-induced for 72 h in 
1 pg ml | tetracycline, except CatL and AP1B, which were pre-induced for 
24 h at 2.5 and 1 ng ml |, respectively. Isobolograms were generated using a 
checkerboard assay as described**; FMK024 (N-morpholineurea-phenylalanyl- 
homophenylalanylfluoromethyl ketone; Sigma) is an irreversible, dual-specificity 
inhibitor of CatL and CatB. All ECs» assays were carried out using alamarBlue 
as described'**’. Southern blotting was carried out according to standard proce- 
dures*®. Subcellular fractionation by hypotonic lysis was carried out as described*’. 
All protein samples were stored in the presence of a protease inhibitor cocktail 
(Roche) and were not boiled. Whole-cell lysates and hypotonic lysis fractions were 
separated by SDS-PAGE using standard protocols”. Immunofluorescence was 
carried out as previously described’®. We used specific antisera to detect ISG75 
(ref. 42), p67 (ref. 43), CatL'’, GLP1 (ref. 44) and AP1y (ref. 45), and anti-MYC or 
anti-GFP antisera were used to detect tagged versions of MFST, UBH1 and 
EMP70. To assess suramin binding, cells were collected at mid-log phase and 
resuspended at 10’ ml! in 35 nM *[H]-suramin (Hartmann Analytic; pre- 
incubated for 16 h in complete HMI11) at 37 °C. Cells were washed in ice-cold 
PBS, resuspended in 100 pl Optiphase Supermix scintillant (Perkin Elmer) and 
°[H]-suramin incorporation quantified using a 1450 Microbeta scintillation 
counter (Perkin Elmer). 
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Conditional modulation of spike-timing- 
dependent plasticity for olfactory learning 


Stijn Cassenaer!? & Gilles Laurent)? 


Mushroom bodies are a well-known site for associative learning in insects. Yet the precise mechanisms that underlie 
plasticity there and ensure their specificity remain elusive. In locusts, the synapses between the intrinsic mushroom 
body neurons and their postsynaptic targets obey a Hebbian spike-timing-dependent plasticity (STDP) rule. Although 
this property homeostatically regulates the timing of mushroom body output, its potential role in associative learning is 
unknown. Here we show in vivo that pre-post pairing causing STDP can, when followed by the local delivery of a 
reinforcement-mediating neuromodulator, specify the synapses that will undergo an associative change. At these 
synapses, and there only, the change is a transformation of the STDP rule itself. These results illustrate the multiple 
actions of STDP, including a role in associative learning, despite potential temporal dissociation between the pairings that 
specify synaptic modification and the delivery of reinforcement-mediating neuromodulator signals. 


Behavioural and genetic experiments in Drosophila and honeybees 
have revealed that the mushroom body, a brain area containing up to 
hundreds of thousands of neurons called Kenyon cells, is critical for 
associative learning of odours’ but not for the expression of innate 
odour-driven behaviours’ (Fig. la). Recent electrophysiological 
experiments in locusts and other insects show that the responses of 
Kenyon cells to odours are highly selective and, thus, rare’*"*. By 
contrast, antennal lobe neurons, the source of the olfactory input to 
Kenyon cells, are few and promiscuous’*. Odour codes are thus 
‘compact’ in the antennal lobe—that is, the representation of each 
odour engages many neurons in a small population—but ‘sparse’ in 
the mushroom bodies (Fig. 1a). Although sparse codes require larger 
neuron populations, they are beneficial for memory because they can 
reduce interference between memory traces'*’*'®. Kenyon cells 
project to two regions, called %- and B-lobes, where they synapse onto 
small populations of ‘extrinsic’ neurons. In locusts, the synapses 
between Kenyon cells and B-lobe neurons (bLNs) are modifiable by 
a Hebbian STDP rule’”"° (Fig. 1a), but nothing so far implicates STDP 
in associative learning there; rather, STDP causes the homeostatic 
regulation of bLN spike timing'’. Recent experimental results in 
moths" show that Kenyon cell responses to odours recorded during 
behavioural learning generally occur and end well before reward 
delivery, indicating that STDP alone cannot support associative con- 
ditioning”’*'. Neuromodulation has been proposed recently as a 
potential solution’. We address this issue with in vivo electro- 
physiology in locusts and discover a complex interplay between 
STDP, reinforcer signals and odour codes in mushroom bodies. 


Dense odour representations in B-lobes 


We first examine odour representations in the f-lobes. Using 
intradendritic recordings, we sampled the responses of 55 bLNs to 
up to 16 odours (Methods and Fig. 1b). Each bLN responded to nearly 
every odour, with responses that differed in intensity, patterning, delay 
and duration across neuron-odour pairs. The probability that a given 
bLN responded to an odour was 0.97. On average, a bLN fired action 
potentials in approximately half of the local field potential (LFP) oscil- 
lation cycles during the odour presentation (0.502 + 0.219, n = 126 


neuron-odour pairs, 25 LFP cycles per response): on average, half of 
the population was active in any given cycle. Hence, these representa- 
tions resemble those in the antennal lobes’* (and exceed them in 
promiscuity) rather than the sparse representations by Kenyon cells, 
to which bLNs are directly connected. 

We tested whether the broad tuning of bLNs might be explained by 
known features of mushroom body circuits. We implemented a simple 
model (Methods and ref. 17) constrained by Kenyon cell response 
statistics and timing’’, by the properties of STDP at Kenyon cell 
synapses’” and by Kenyon cell/bLN (KC-bLN) connectivity ratios 
estimated from experiments’’. With such a model, we could reproduce 
the bLN firing phase observed experimentally’” (Fig. 2a, left) but not 
the odour-response intensity or probability (Fig. 2a, right, and Sup- 
plementary Fig. 1.1a). Rather, activity across the model bLN (mbLN) 
population saturated rapidly when STDP was turned on. This beha- 
viour is a known property of rate-based Hebbian learning in model 
networks and can be counteracted by imposing synaptic weight 
bounds or renormalization rules”. 


Inhibition limits STDP and saturation 


In examining our experimental data on bLN responses to odours, 
we observed that the periodic excitatory input originating from 
Kenyon cells was often curtailed by phasic inhibitory postsynaptic 
potentials (Fig. 2b, arrowheads), with onsets at the typical phase of 
bLN action potentials. We hypothesized that these inhibitory post- 
synaptic potentials originate from lateral inhibition among bLNs (as 
put forward in a theoretical exploration’’). This was supported by 
extracellular stimulation of Kenyon cells (Supplementary Fig. 2) 
and confirmed in paired bLN recordings. Beta-lobe neurons inhibit 
each other (Fig. 2c) with an estimated connection probability of 28% 
(n = 32 connections, 2 reciprocally connected pairs). Although 
unknown so far, other interneuron populations may also contribute 
to bLN inhibition. 

In each oscillation cycle, lateral inhibition reduces the likelihood of 
late bLN spikes; thus, it should limit the ability of STDP to potentiate 
KC-DbLN synapses. This in turn should curb bLN population activity 
(see also ref. 26). As predicted, implementing lateral inhibition 
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Figure 1 | Beta-lobe neurons are promiscuous. a, Olfactory circuit in which 
bLNs are embedded. Ant., antennal; lat., lateral. b, Rasters constructed from in vivo 
intracellular dendritic recordings, illustrating responses of four bLNs (different 
animals) to ten odours. Shaded region represents odour delivery (1s). For each 
bLN, intracellular traces at bottom correspond to the tenth trial with the last odour. 


between mbLNs (based on the experimental connectivity estimate 
and amplitude distribution) reduced population activity to ~50% 
(Presponse = 0.51 0.14 (response probability), n=30 mbLNs; 
compare With Pyesponse = 0.49 + 0.18, n = 30 bLNs from experiment; 
Fig. 2d-f). The mbLN spike phase remained centred on m (LFP 
trough). In addition, the response profiles of those mbLNs were very 
similar to those recorded experimentally (compare distributions in 
Fig. 2e, f and moments in Fig. 2g; see also Supplementary Figs 3-5). 
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Horizontal scale bars, 500 ms; vertical scale bars, 10 mV; spikes are clipped. 
Odours: 2-heptanone (2hp), cherry (che), cis-3-hexen-1-ol (chx), 3,7-dimethyl- 
2,6-octadiene-nitrile (don), 1-heptanol (hpo), 1-hexen-3-ol (hx3), 1-hexanol 
(hxo), mint (min), nonanal (nna), 5-nonanone (nnn), octanal (oca), 1-octanol 
(oco), 3-pentanone (pnn), trans-2-hexen-1-ol (thx), 6-undecanone (unn). 


We conclude that bLN activity, although very different from the 
sparse output of Kenyon cells, can be explained by the combined 
effects of Hebbian STDP at KC-bLN synapses and lateral inhibition 
between bLNs. The useful consequences are that bLN spikes are 
tightly phase-locked to the LFP’” and that instantaneous bLN popu- 
lation activity lies around the midpoint of its dynamic range. This 
increases the coding capacity of this small population of output 
neurons and is relevant for plasticity, as explored below. 
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Figure 2 | Saturation of activity caused by STDP can be counteracted by 
lateral inhibition. a, Mean firing phase (left) and normalized population 
activity (right) in a simulation of a KC-bLN network (30 Izhikevich units“, one 
LFP cycle, 1,000 trials). Each point is a ten-trial average; STDP is turned on in 
trial 11. Each curve is run with n Kenyon cell inputs per mbLN (20 <n < 80, 
10% connections in common). STDP regulates bLN firing phase (ref. 17) but 
inexorably drives bLN population to saturation. b, In vivo intradendritic 
recording of bLN membrane potential, V,,, together with LFP during odour 
presentation. Arrowheads indicate onsets of inhibitory postsynaptic potentials, 
timed on LFP troughs. c, Top left: frequency distribution of inhibitory 
postsynaptic potential amplitudes, measured at baseline, in 30 bLNs; black 
curve, fit used for model in d. IPSP, inhibitory postsynaptic potential. Bottom 
left: paired bLN recording revealing direct inhibitory connection. Right: bLN 
spike-triggered averages from seven distinct bLN pairs. d, Simulation of a bLN 
network as in a, except with n Kenyon cell inputs per bLN (here 22 <n < 130, 
same drive at simulation onset as in a). Lateral inhibition among bLNs is as 
determined experimentally. Mean phase (left) is minimally affected; network 
activity (right) settles near midrange. e, Simulation of a bLN network as in 

d, extended over 25 LFP cycles, with time-varying Kenyon cell input profiles 
derived from experiments'*’”*>. Post-stimulus time histograms plotted as firing 
probability versus cycle rank for each of the 30 mbLNs (ten trials), post STDP. 
f, In vivo intradendritic responses of 30 bLNs to trans-2-hexen-1-ol represented 
as in e (from 16 different animals). g, Comparison of experimental (black) and 
model (grey) bLN response statistics. Same simulation as in e (darkest grey) 
together with two other simulations of the same network with different Kenyon 
cell input profiles (lighter shades of grey). 


Conditional modulation of STDP rule 


Mushroom bodies are critical for associative conditioning in 
insects®''’’, and second-messenger pathways involved in memoriza- 
tion and retrieval have been described in Drosophila’*’**”°. Yet we 
know little about the synaptic mechanisms underlying conditioning 
there*’, and even less about how specificity is achieved or about the 
roles that STDP might have in it. Associative learning in insects seems 
to rely on neuromodulators*’~ (for example octopamine (OCT) and 
dopamine) to mediate unconditioned signals (appetitive and aversive, 
respectively, for OCT and dopamine). In locusts and other insects, 
these neuromodulators are delivered nonspecifically by small numbers 
of octopaminergic*® and dopaminergic” neurons with projections to 
the mushroom body lobes (among others), that is, where Kenyon cells 
contact bLNs. 
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We assessed the effect of local OCT injection into the B-lobes on 
plasticity and tuning of those bLNs in vivo (Fig. 3). Figure 3d illus- 
trates our experimental schemes, which were designed to probe spe- 
cificity, that is, the question of how the system determines which 
synapses should change when the reinforcer is global. During an 
odour stimulus, two features distinguish odour-evoked Kenyon cell 
spikes from ‘stray’ spikes produced randomly by non-responding 
Kenyon cells (most of the Kenyon cells): odour-evoked spikes are 
synchronized and cause bLNs to spike by their collective action. 
Hence, odour-evoked release at Kenyon cell synapses occurs within 
the time window for STDP. We thus tested the effect of OCT injection 
on synapses that differed only in their temporal features of pre- and 
postsynaptic activation. In each experiment, we measured the res- 
ponses of one bLN to Kenyon cell stimulation at a rate of 0.18’ at 
two locations (a pairing location (P) and a control location (C)) and 
two different times (C 300 ms before P). We found Kenyon cell stimu- 
lation sites that converged onto the same recorded bLN in 13 out of 20 
experiments. 

In ‘test’ trials (Fig. 3d, top), the bLN was depolarized by a brief pulse 
of current to produce one single action potential 5 s after Kenyon cell 
stimulation at P, hence well outside the window of dt = fthost — tpre = 
+25 ms required for STDP”, where fpost and tpre are respectively the 
post- and presynaptic spike times. Pairing consisted of five trials at 
0.1s 1, of either of two kinds. In ‘STDP’ trials, the bLN was fired 
within +25 ms of a Kenyon cell stimulus at P (Fig. 3d, middle). In 
‘STDP + OCT’ trials, pairing was as in STDP trials but was followed 
by one OCT injection (5 pl) 1s after pairing (Fig. 3d, bottom). Test 
trials were carried out before STDP trials (Fig. 3a—c, red), after STDP 
trials (black) and after STDP + OCT trials (blue). All test and pairing 
trials thus contained the same numbers of pre- and postsynaptic 
pulses; only their relative timings differed. Examples with three values 
of dt are shown (Fig. 3a-c). In each, OCT caused a depression. We 
note that only connections between cells co-activated within the 
STDP window were depressed (that is, P but not C; see also 
Supplementary Fig. 6). The depressive action of OCT on synapses 
undergoing STDP (20 different values of dt Fig. 3e, f and 
Supplementary Fig. 7) was greatest for dt > 0, converting baseline 
Hebbian STDP into one where only depression occurred. 

To rule out the possibility that the action of OCT resulted not from 
the pairing per se, but from the temporal proximity of OCT delivery to 
the pairing (P), we reversed the order of C and P (m= 3; Sup- 
plementary Fig. 8). Again, only the P pathway was affected by en 
(effect of OCT on control (mean + s.e.m.): —0.9 + 0.49%, n= 
(P then C); 0.2 + 2.28%, n = 10 (C then P); sets statistically sea 
P> 0.66, t-test). Hence, because the control and paired synapses 
shared the same postsynaptic partner (C and P differed only in the 
identity of the presynaptic cells and in the timings of the presynaptic 
spikes relative to those in their common bLN), and because the two 
sets of presynaptic cells were stimulated within only 300 ms of one 
another (C then P or P then C), we conclude that the relative timing of 
pre- and postsynaptic firing is what contributes to making the 
synapses susceptible to OCT action. We do not know whether pre-post 
delays longer than +25 ms (but shorter than 300 ms) would have equi- 
valent effects. However, almost all odour-evoked Kenyon cell action 
potentials occur within the range of delays we tested. We conclude that, 
despite its global release, OCT affects specific synapses—those in which 
pre- and postsynaptic partners fired within the appropriate delay 
between one another. This is the case even though OCT was delivered 
1s after the relevant pairing (reproducing a temporal dissociation 
between conditioned stimulus and unconditioned stimulus”). 


Conditional changes of STDP are specific 


The STDP curves in Fig. 3f were obtained in vivo with electrical 
activation of the pre- and postsynaptic cells. We next tested the effect 
of OCT on the responses of bLNs to odours. We predicted that if OCT 
depresses KC-bLN synapses that fire within the STDP time window, 
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Figure 3 | OCT changes the STDP rule. a, Five trials pairing OCT with 
depressing STDP (6t< 0) reduced KC-bLN excitatory postsynaptic potential 
(EPSP) size (blue) more than did two sets of five-trial STDP with negative dt 
alone (black). EPSPs evoked by interleaved stimulation with the control 
electrode (embedded among Kenyon cells in a different location) were 
unaffected (control, bottom right), despite equal exposure to OCT. b, Five trials 
pairing OCT with 6t~ 0 STDP reduced KC-bLN EPSP size (blue), whereas 
STDP alone with this dt value had no effect (black). Control EPSPs were 
unaffected (bottom right). c, Five trials pairing OCT with potentiating STDP 
(St > 0) reduced KC-bLN EPSP size after it was potentiated by two bouts of 
STDP alone with this 5¢ value. Control EPSPs were unaffected (bottom right). 
d, Stimulation protocols. Test trials were used to evaluate the amplitudes of 
KC-bLN EPSPs before STDP, after STDP and after STDP + OCT, as shown in 
a-c, and to compute (from multiple experiments) values in e and f. C, control 
Kenyon cell input; P, paired Kenyon cell input. In test trials, bLNs were 
depolarized with a 5-ms d.c. pulse (causing a single bLN spike) ~5 s after 


it should also reduce bLN output. This was tested first with our model, 
using an STDP rule modified according to Fig. 3f: responses were 
reduced in 28 of 30 mbLNs (Fig. 4a, b). We then tested the prediction 
experimentally with 12 bLNs (18 bLN-odour pairs), each recorded 
intracellularly from a dendrite. In 16 of 18 instances, OCT caused a 
reduction of bLN output, averaged over the odour response (Fig. 4c, 
d). LFP power in the mushroom body calyx was unaffected (Fig. 4f), 
consistent with unchanged input to the mushroom body. 

We also evaluated the specificity of changes in bLN-odour res- 
ponses. A bLN was presented with three odours in blocks of trials 
in the order indicated (Fig. 4e, left). OCT was injected 800 ms after 
odour onset, paired each time with only one of the three odours. As in 
Fig. 4c, d, the responses of the bLN to the three odours (assembled by 
odour in three graphs; Fig. 4e, right) were reduced after pairing with 
OCT (compare blue and red post-stimulus time histograms). The 
reduction of the response to an odour occurred only after pairing of 
that particular odour with OCT. For example, responses to cherry and 
trans-2-hexen-1-ol were minimally affected by pairing OCT with 
2-heptanone; they became significantly reduced only after the appro- 
priate pairing of either cherry or trans-2-hexen-1-ol with OCT. 
Figure 4f compares summary data for odours paired with OCT 
(n= 18 bLN-odour pairs) to controls (n = 12 bLN-odour pairs: 
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extracellular stimulation (0.1-ms pulses) of Kenyon cells in two different 
locations (C and P, 300 ms apart). In STDP trials, bLNs were depolarized as 
above, causing a single spike in a narrow window around the time of Kenyon 
cell stimulation (P). Control Kenyon cell stimulation (C) was offset by 300 ms. 
STDP + OCT trials were the same as STDP trials, only with the addition of 
localized OCT injection (50-ms pressure-pulse) in the B-lobe, 1s after P 
Kenyon cell stimulation. e, Summary data from 20 experiments, plotting for 
each experiment the effect of STDP + OCT versus STDP alone (blue dots) and 
plotting control data, with and without OCT, for 13 of 20 experiments (black 
circles). Changes due to STDP + OCT differ significantly from control + OCT 
(P < 0.003, t-test). Changes due to OCT in control not significantly different 
from zero (P > 0.84, t-test; see also Supplementary Fig. 7). f, Comparison 
between change due to STDP + OCT and change due to STDP alone, each 
plotted as a function of dt. Changes due to STDP + OCT differ significantly 
from zero (relative to before STDP: P < 0.0001; relative to STDP only: 
P<0.0002, t-test; see also Supplementary Fig. 7). 


odour responses after pairing of a different odour with OCT (n = 7) 
and odour responses after pairing of electrical Kenyon cell stimula- 
tion with OCT (n= 5); n= 18 LFP recordings simultaneous with 
intracellular bLN recordings). These results support the observation 
that OCT specifically depresses contacts between cells co-activated 
within the STDP window (Fig. 3); they also indicate that KC-bLN 
pairing and OCT should co-occur within an interval of at most a few 
tens of seconds. 


Discussion 

In the absence of OCT, STDP at KC-bLN synapses adaptively regu- 
lates the bLN spike phase’’ and bLN population output, helped by the 
action of lateral inhibition between bLNs (Fig. 2). In the context of 
reinforcer delivery, STDP gains a new role: it tags—presumably 
through biochemical changes dependent on pairing—the synapses 
between connected Kenyon cells and bLNs that have fired an action 
potential within the STDP pairing window, itself commensurate with 
the oscillation period in this system**. This renders those synapses 
susceptible to the action of OCT (which is proposed to mediate positive 
reinforcement in insect learning'***”°) and reduces the inhibitory out- 
put of the mushroom body mediated by bLNs. Hence, the action of 
OCT can be specific, even though its release is diffuse and often delayed 


ARTICLE 


Model Experiment 
i Before OCT 
Before bias 
a F c After OCT 
mbLN4 After bias mbLN2 bLN1 bLN2 
20 15 
= z 
2 o 
e 8 
2 d a 
bi z 
-15' 
0 0.5 1 1.5 ) 10 20 30 0 05 1 18 0 12 18 
Time (s) mbLN-odour pair Time (s) bLN-odour pair 
e ohp Experiment 
= 
2 oe 20 2hp 104 thx 
3 che Ss Ofmm” 
c thx y S 
@ | 2hp_ 2hp wy & | 
Q |+OCT™ 2hp %s ® -10 
a thx & 
° che che 2 5 -2049§bLN LFP 
2 Fe che | | ctrl power 
© feoct™ thx 


Time (s) 


Figure 4 | Odour-specific decrease of mushroom body output by OCT. 

a, Simulation of bLN network as in Fig. 2e. Smoothed post-stimulus time 
histograms (PSTHs) for two mbLNs (ten trials; shading, s.e.). Red, responses at 
equilibrium; blue, responses after ten trials with STDP + OCT curve shown in 
Fig. 3f. b, Left: difference in PSTHs for all 30 mbLN responses before and after 
five trials of STDP + OCT (‘PSTH after pairing’ minus ‘PSTH before pairing’, 
thin lines). Responses of individual units at particular times can increase or 
decrease, but average difference (thick line) is less than zero at all times. Right: 
time-averaged differences for each of the 30 mbLNs (‘PSTH after pairing’ 
minus ‘PSTH before pairing’). In some instances, mbLN responses increased 
(for most units this occurred transiently; for a few mbLNs the entire time- 
averaged response increased). This is due to reduced lateral inhibition (that is, 
disinhibition) because mbLN population activity decreased. c. In vivo 
intradendritic recordings of bLN odour responses before and after pairing of 
the odour with localized OCT injection in the B-lobe (50-ms pressure-pulse, 
800 ms after odour onset, either five (n = 5) or ten pairing trials (n = 13)). 
PSTHs for two bLNs (ten trials; shading, s.e.): red, responses before odour- 
OCT pairing; blue, responses after ten trials of odour paired with OCT 
injection. d, Same difference of PSTHs as in b, for 18 recorded bLN-odour pairs 
(12 bLNs). As in the model, responses of individual bLNs can increase or 


relative to the conditioned (odour) stimulus. This provides experi- 
mental support for related theoretical propositions”**°*". Our results 
also solve the temporal dissociation puzzle revealed in experiments". 

Because the selectivity of synaptic change allowed by STDP relies 
on the rare co-occurrence of pre- and postsynaptic spikes, it is bene- 
ficial that Kenyon cell firing is rare and that odour representations by 
Kenyon cells are sparse, as observed’*"'*. We propose that olfactory 
associative memories are stored, at least in part, as sparse sets of 
synaptic weights between Kenyon cells and mushroom body output 
neurons. In the absence of reinforcer, STDP and lateral inhibition 
between bLNs set the (default) instantaneous output of the B-lobe 
network near the middle of its dynamic range. This serves at least 
two purposes. First, it maximizes the number of possible distinct com- 
binations of simultaneously active bLNs. Second, it ensures that the 
output of the network can be increased or decreased, if an associative 
reinforcer is to co-occur. It is possible that some of the molecular 
pathways revealed by Drosophila neurogenetics®*"' have a role in the 
phenomenon described here. The functional form of STDP has 
recently been shown to depend on neuromodulatory context in mam- 
malian brain slices****’, suggesting that our results may have broad 
implications for STDP and learning. 


METHODS SUMMARY 


All results were obtained in vivo from locusts housed in a crowded colony. Odours 
were delivered by injection of a controlled volume of odorized air within a 


decrease transiently, but on average there is a net decrease. e, OCT-induced 
changes are odour specific. Three odours are sequentially paired (during five 
trials) with OCT to assess carry-over of pairing across odours. Order of stimuli 
is shown at left. Red colours, responses before pairing with OCT; blue colours, 
responses after pairing. Odours are presented in ten-trial blocks and PSTHs are 
shown for each block (shading, s.e.). Changes due to nonspecific pairing are 
minimal by comparison with those due to odour-specific pairing. f, Summary 
data comparing the effect of odour-OCT pairing (blue: n = 18 bLN-odour 
pairs, 12 bLNs) with those of controls (grey: n = 12 bLN-odour pairs, 9 bLNs; 
black (LFP controls): n = 18 recordings simultaneous with intracellular bLN 
recordings). Dark grey (m = 7), effect on bLN response to non-paired odour 
before and after pairing another odour with OCT. Light grey (n = 5), effect on 
bLN response to non-paired odour before and after pairing electrical activation 
of Kenyon cells with OCT. Black, effect on LFP power (10-30-Hz band, 
recorded in mushroom body) before and after pairing odour with OCT. 
Changes due to pairing odour with OCT differ significantly from zero 

(P < 0.0006, t-test) and from those in the three control conditions (P < 0.002- 
0.022, t-test). Changes in control conditions were not significantly different 
from zero (P > 0.72 (dark grey), 0.64 (light grey) and 0.33 (black)). 


constant stream of desiccated air. The results presented here are derived from 
intradendritic recordings, made using sharp micropipettes, of 76 neurons in 55 
locusts. LFPs were recorded with silicon probes (NeuroNexus) in the mushroom 
body Kenyon cell soma cluster. Octopamine (100 |1M) was injected locally into 
the mushroom body -lobe using a picopump (WPI). Electrical stimulation of 
Kenyon cell somata was done with modified tetrodes (FHC). Stimulation protocols 
and data analysis were carried out using specialized software (LABVIEW, National 
Instruments; IGOR, Wavemetrics; and MC_STIMULUS, Multichannel Systems). 
Simulations of networks of ‘Izhikevich units’** were done using MATLAB 
(Mathworks). 


Full Methods and any associated references are available in the online version of 
the paper at www.nature.com/nature. 
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METHODS 


Preparation and stimuli. All results were obtained in vivo from locusts 
(Schistocerca americana) housed in an established, crowded colony. Young adults 
of either sex were immobilized in a holder. Both antennae were secured in place 
with respect to the olfactory delivery system and remained intact for olfactory 
stimulation. The brain was exposed, desheathed and superfused with locust 
saline, as previously described”. 

Odour delivery. Odours were diluted 10% v/v in paraffin oil. They were delivered 
by injection of a controlled volume of odorized air within a constant stream of 
desiccated air. Total airflow was set to 0.851min~’ and the odour was further 
diluted by one-third in air. Teflon tubing was used at and downstream of the 
mixing point to prevent odour lingering and cross-contamination. 
Intradendritic recordings. Sharp electrode recordings from the dendrites of 
bLNs were made with borosilicate glass micropipettes (d.c. resistance, 150- 
200 MQ) filled with 1M potassium acetate. The cell type from which the data 
are derived could be recognized by several characteristics: responses to odour, 
subthreshold baseline activity profile and response to electrical stimulation of 
Kenyon cells'’. Recordings from bLNs were always made from dendrites in the 
B-lobe. To assess changes in bLN instantaneous firing rate, bLN spike times were 
either binned per simultaneously recorded LFP cycle (Fig. 2f) or convolved with a 
50-ms Gaussian kernel to generate smoothed PSTHs (Fig. 4c, e; shading, s.e.). 
EPSP slope was measured as the maximum slope over the 10-90% range of 
maximum amplitude. The average EPSP waveforms shown at bottom in 
Fig. 3a—c were computed from the same data as the slope measurements shown 
at top in Fig. 3a—c. The electrical stimulation rate was 0.1 s~' (with no intertrial 
gap). The results in this work were obtained from 55 animals; in two instances the 
same animal was used for two distinct bLNs with OCT injection. 

LEP recordings. LFPs were recorded in the mushroom body Kenyon cell soma 
layer using Michigan probes (http://www.neuronexustech.com). 

Electrical stimulation. Twisted-wire tetrodes obtained from FHC (no. CE4B75) 
were modified for monopolar stimulation, with the casing serving as the anode. 
The tips of the tetrodes were splayed such that the distance between the exposed 
tips was approximately equal to 60% of the diameter of the mushroom body calyx. 
The exposed end of the stimulating electrode was embedded among Kenyon cell 
somata. The tetrodes were electroplated with gold solution to impedances of 200- 
350 kQ at 1 kHz. Stimulating currents (5-140 1A, 0.1 ms) were generated using an 
STG1000 Multichannel System (as in ref. 17). 

OCT injection. Patch pipettes were back-filled with a 100.44M OCT hydro- 
chloride, 0.01% Fast Green solution. A pneumatic picopump (WPI) was used 
to apply one 50-ms, 5-p.s.i. pressure pulse per trial. Each pulse injected ~5 pl (as 
measured by previous injection into a drop of oil). The injected solution was 
localized to the f-lobe, as verified by bounded diffusion of the Fast Green dye. 
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Simulations. All simulations were carried out with networks of Izhikevich 
units“. Networks consisted of 30 units each with dynamics given by 


dv 5 
a =0.04v" +5v+140—u+I 
du 
— =a(bv— 
di a(bv—u) 
where a = 0.02, b= 0.2, c 65, d= 8 and if v>30 then v—candu<u+d. 
Synaptic interactions and external inputs were introduced through the J term, 
which for the jth unit takes the form 


= + Won (i,f)forn @) + > wxclk,ffcc(k) 


Here wp.n is a matrix of inhibitory weights among network units, fin is the 
vector of spike times generated by the network, wxc is a matrix of excitatory input 
weights and fc is the vector of spike times that constitutes the input to the 
network. At every trial, fxc is drawn from a distribution characterized by three 
experimental observations: odour-evoked Kenyon cell spiking phase distribution 
relative to the LFP’’, average odour-evoked Kenyon cell population PSTH* and 
Kenyon cell baseline activity'’. The matrix wp.n is estimated from experimental 
recordings and kept fixed during the simulation. The matrix wxc is drawn from a 
normal distribution at the onset of the simulation, with variable means for dif- 
ferent initial conditions; it is updated in every trial according to 


= { Af _(w)K(At) 
~ Af) K(At) 


The temporal filter, K(At), is the experimentally derived STDP curve: either the 
unbiased Hebbian form as in ref. 17 or the depression-biased form that results 
from pairing with OCT as in Fig. 3f. The learning rate was set by the number of 
pairings carried out to arrive at the STDP curve in experiments. The weight 
dependences of the updating functions f,(w) and f_(w) are as follows: 


f+ (w)=-—wy" 
f-(w)=w 


As described in ref. 24, the value of 4 determines the shape of the equilibrium 
weight distribution. A value (4 = 0.08) was chosen such that the weight distri- 
bution in the simulation at equilibrium matched the experimental EPSP size 
distribution (Supplementary Information, section 3). As described above, para- 
meters in the model were constrained by experimental estimates, except for a, b,c 
and d, which characterize the dynamics of individual model units. These para- 
meters took the typical values used for regular spiking neurons“. 


ifAt <0 
ifAt>0 
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Mutations in kelch-like 3 and cullin 3 cause 
hypertension and electrolyte abnormalities 
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Hypertension affects one billion people and is a principal reversible 
risk factor for cardiovascular disease. Pseudohypoaldosteronism 
type II (PHAITI), a rare Mendelian syndrome featuring hypertension, 
hyperkalaemia and metabolic acidosis, has revealed previously 
unrecognized physiology orchestrating the balance between renal 
salt reabsorption and K* and H™ excretion’. Here we used exome 
sequencing to identify mutations in kelch-like 3 (KLHL3) or cullin 3 
(CUL3) in PHAII patients from 41 unrelated families. KLHL3 
mutations are either recessive or dominant, whereas CUL3 muta- 
tions are dominant and predominantly de novo. CUL3 and BTB- 
domain-containing kelch proteins such as KLHL3 are components 
of cullin-RING E3 ligase complexes that ubiquitinate substrates 
bound to kelch propeller domains” *. Dominant KLHL3 mutations 
are clustered in short segments within the kelch propeller and BTB 
domains implicated in substrate’ and cullin’ binding, respectively. 
Diverse CUL3 mutations all result in skipping of exon 9, producing 
an in-frame deletion. Because dominant KLHL3 and CUL3 muta- 
tions both phenocopy recessive loss-of-function KLHL3 mutations, 
they may abrogate ubiquitination of KLHL3 substrates. Disease 
features are reversed by thiazide diuretics, which inhibit the 
Na-Cl cotransporter in the distal nephron of the kidney; KLHL3 
and CUL3 are expressed in this location, suggesting a mechanistic 
link between KLHL3 and CUL3 mutations, increased Na-Cl 
reabsorption, and disease pathogenesis. These findings demonstrate 
the utility of exome sequencing in disease gene identification despite 
the combined complexities of locus heterogeneity, mixed models of 
transmission and frequent de novo mutation, and establish a fun- 
damental role for KLHL3 and CUL3 in blood pressure, K* and pH 
homeostasis. 


A small number of genes causing Mendelian forms of hypertension 
have been identified, establishing the role of increased renal salt 
reabsorption in its pathogenesis’””*. The study of PHAII has identified 
a physiological mechanism that orchestrates activities of diverse 
electrolyte flux pathways, allowing maximal salt reabsorption in res- 
ponse to aldosterone when angiotensin II (AII) is elevated, as in settings 
of reduced intravascular volume (hypovolaemia), versus maximal 
potassium secretion in settings of hyperkalaemia, in which aldosterone 
is elevated without changes in AII'. The role of WNK kinases in this 
process was revealed by discovery of their mutation in a small fraction 
of PHAII kindreds'’. Dominant gain-of-function mutations in WNK4 
or WNK1 lead to constitutively increased salt reabsorption in the distal 
nephron regardless of volume status, resulting in hypertension, and 
inhibition of K* secretion despite marked hyperkalaemia’!'*””. 

We studied a cohort of 52 PHAII kindreds, including 126 affected 
subjects with renal hyperkalaemia and otherwise normal renal function; 
hypertension and acidosis were present in 71% and 82%, respectively. 
There was wide variation in disease severity and age of clinical presenta- 
tion (Supplementary Figs 1 and 2). Mutations in WNK1 or WNK4 were 
present in only seven of these kindreds (13%). Those without WNK 
mutations had only 2.0 + 1.4 affected members, complicating mapping 
efforts. 

Exome sequencing of eleven unrelated PHAII index cases without 
WNK mutations was performed. Index cases and affected relatives (five 
trios and one quartet) were also subjected to genome-wide SNP genotyp- 
ing. Tabulation of high quality novel protein-altering variants revealed 
124 genes with three or more variants, 50 with four or more, and 23 
with five. Concurrent analysis of linkage among the multiplex families 
was used to prioritize loci harbouring variants that co-segregated with 
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disease; this identified 28 genes with novel protein-altering variants 
that co-segregated with disease in two or more multiplex families. This 
revealed KLHL3 as a strong candidate, with novel KLHL3 mutations 
comprising five alleles in three kindreds, all of which co-segregated 
with the trait. These include one kindred in which affected members 
are homozygous for a nonsense mutation (W470X), one in which 
affected members are compound heterozygotes for two missense 
mutations (F322C and S410L), and one segregating a heterozygous 
missense mutation (R528H). As a confirmation of significance, 
Fisher’s exact test was used to compare the prevalence of novel protein- 
altering variants in all genes in PHAII cases versus 699 control exomes. 
A single gene, KLHL3, showed a burden of mutation that surpassed 
genome-wide significance (P = 1.1 X 10°; Supplementary Tables 1-3). 


KLHL3 was sequenced in all PHAII index cases, identifying novel 
mutations in 24 (Fig. 1a, b and Supplementary Figs 3 and 4). Nearly all 
are at positions conserved among orthologues (Supplementary Fig. 5). 
Sixteen kindreds have heterozygous mutations that co-segregate with 
the trait under a dominant model (log of odds (lod) score 6.9, <—2 
under other models). In contrast, eight index cases inherited mutations 
in both KLHL3 alleles. In these kindreds, affected members are con- 
fined to siblings of index cases who inherited the same two mutations, 
whereas unaffected relatives inherited zero or one mutation (lod score 
4.3 for a recessive model, <—2 for other models). Recessive trans- 
mission has not been previously described for PHAII. Consistent 
with two modes of transmission, subjects with dominant KLHL3 
mutations had significantly higher serum K™ levels (6.2 + 0.6 mM) 
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Figure 1 | Recessive and dominant KLHL3 mutations in PHAII kindreds. 
a, b, Representative kindreds demonstrating recessive (a) and dominant 

(b) KLHL3 mutations (all 24 kindreds are shown in Supplementary Figs 3 and 
4). Affected, unaffected and phenotype-undetermined subjects are denoted by 
black, white and grey symbols, respectively. KLHL3 alleles are denoted by ‘+’ 
(wild type), ‘d’ (recessive mutation) and ‘D’ (dominant mutation). Sequence 
traces show wild-type (WT) and mutant (*) alleles and encoded amino acids. 
het., heterozygous; hom., homozygous. ¢, KLHL3 protein sequence. Coloured 
bars indicate BTB domain (lavender), BACK domain (peach) and kelch 
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propeller blades (B1-B6, grey) with B-strands ‘a-d’ in yellow, red, green and 
blue, respectively. Recessive (light blue) and dominant (pink) mutations are 
shown; recurrences are indicated by numbers. d, Kelch propeller schematic, 
from the KLHL2 crystal structure”. §-Strands are coloured as in c; dominant 
mutations are indicated. e, CRL schematic, comprising a BTB-containing kelch 
protein (KLHL3), CUL3 and a ubiquitin (Ub)-transfer-mediating RING 
protein, with substrate bound via the kelch propeller. The complex is shown as a 
dimer’. C term., C terminus; N term., N terminus. 
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than heterozgyotes for recessive mutations (4.8 + 0.6 mM) (P< 10-4, 


Student’s t-test; normal range 3.5-5.0 mM). These findings establish 
that PHAII can be caused by either recessive or dominant KLHL3 
mutations. Importantly, we infer that mutations in dominant kindreds 
are probably dominant-negative, because they phenocopy the features 
of recessive disease. 

KLHL3 contains an amino-terminal BTB domain, a BACK domain, 
and carboxy-terminal kelch-like repeats that form a six-bladed 
B-propeller structure**” (Fig. 1c-e). There are over 50 BTB-containing 
kelch (BTB-kelch) genes in humans’; their propeller domains bind 
substrate proteins, promoting substrate ubiquitination via interaction 
of the BTB domain with CUL3, a component of a cullin-RING E3 
ubiquitin ligase (CRL)*°**. Ubiquitination serves diverse functions, 
including targeting proteins for proteasomal degradation as well as 
non-degradative roles such as modulation of protein activity, inter- 
action and localization’*. 

Whereas recessive KLHL3 mutations are distributed throughout the 
encoded protein, dominant KLHL3 mutations show marked clustering 
(Fig. 1c). Nine of sixteen dominant mutations alter one of the last four 
amino acids of the six ‘d-a’ loops that connect the outermost (‘d’) 
B-strand of one kelch propeller blade to the innermost (‘a’) B-strand 
of the next blade. Two others are in “b-c’ loops. These dominant PHAII 
mutations lie near the hub of the propeller (Fig. 1d) at or near sites 
implicated in substrate binding in paralogues’ (Supplementary Fig. 5). 
Three other dominant mutations cluster within the BTB domain, at or 
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near sites implicated in cullin binding in paralogues*. We infer that 
dominant mutations in KLHL3 probably impair binding either to 
specific substrates or to CUL3. 

After accounting for KLHL3, WNK1 and WNK4 mutations, 21 
PHAII kindreds without mutations remained. We considered the pre- 
sumed functional partner of KLHL3, CUL3 (Fig. le), as a potential 
candidate. Among PHAII exomes, novel heterozygous CUL3 variants 
were suggested in two. Sequencing CUL3 in all index cases identified 
17 with novel heterozygous mutations, all in cases without KLHL3, 
WNKI1 or WNK4 mutations (Fig. 2a and Supplementary Fig. 6). Eight 
of these mutations were documented to be de novo, providing over- 
whelming evidence that these mutations are disease-causing. CUL3 
mutations all cluster in sites implicated in splicing of exon 9, including 
the intron 8 splice acceptor (m = 4), the intron 9 splice donor (n = 5), 
the putative intron 8 splice branch site (n = 5), and a putative splice 
enhancer in exon 9 (n = 3, within a TTGGA(T/A)) splice enhancer 
consensus sequence”*) (Fig. 2b). 

To test the impact of these mutations on splicing, CUL3 genomic 
DNA spanning exon 8 to exon 10 containing either wild-type sequence 
or one of nine PHAII mutations was cloned and expressed in HEK293 
cells, and the spliced RNA products were analysed. Whereas the wild- 
type sequence produces a properly spliced product containing all three 
exons, each of the mutants produces a predominant product that skips 
exon 9, joining exon 8 to exon 10 (Fig. 2c, d). This results in an in-frame 
57 amino acid deletion (residues 403-459) in the segment linking the 
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Figure 2 | Dominant CUL3 mutations in PHAII kindreds cause skipping of 
exon 9. a, Representative kindreds demonstrating CUL3 mutations, depicted 
as in Fig. 1 (all 17 kindreds are shown in Supplementary Fig. 6). int8, intron 8; 
ex9, exon 9; int9, intron 9. Positions are numbered relative to splice sites and the 
first base of the exonic splice (ES) enhancer. b, CUL3 mutation locations. 
Consensus splicing sequences'*”* and corresponding wild-type CUL3 
sequences are shown; invariant bases (green) and consensus homology (yellow) 
are indicated. Mutations are shown in red; recurrences are indicated by 


\ 


Exon 9 skipped 

numbers. Ins.G, insertion of a G (guanine). ¢, Reverse transcription- 
polymerase chain reaction (RT-PCR) of spliced RNA. Wild-type CUL3 
constructs produce a single product including exons 8, 9 and 10 (844 bp); all 
nine mutants tested produce a predominant product that skips exon 9 (673 bp). 
d, Representative RT-PCR sequences. The wild-type construct produces 
complementary DNA with properly spliced junctions between exons 8-9 (top) 
and 9-10 (middle), whereas the mutant construct (splice donor g(+1)c) 
produces cDNA joining exon 8 to exon 10 (bottom). 
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Figure 3 | KLHL3 expression in the kidney. a-f, Mouse kidney sections 
stained with antibodies to KLHL3 (red), TRPM6 (a marker of the DCT”, green) 
and AQP2 (a marker of the collecting duct*®, blue). Scale bars, 25 jim. 

a-c, Staining for KLHL3 (a), TRPM6 (b) and the merged image 


BTB-binding and RING-binding domains of CUL3. The fact that 
CUL3 mutations phenocopy recessive KLHL3 mutations suggests that 
they abrogate CUL3 function at KLHL3 targets. 

As with PHAII caused by WNK1 and WNK4 mutations’, virtually 
all patients with KLHL3 and CUL3 mutations have been treated with 
thiazide diuretics, inhibitors of the Na—Cl cotransporter (NCC), result- 
ing in correction of phenotypic abnormalities. WNK4 regulates the 
activities of NCC'*’*"®, the epithelial Na‘ channel ENaC”, and the K* 
channel ROMK", and is co-expressed with these proteins in the renal 
distal convoluted tubule (DCT) and collecting duct'?*”°. Staining 
mouse kidney sections with specific antibodies demonstrated that 
KLHL3 is predominantly expressed in the DCT and collecting duct, 
with apical localization in the DCT (Fig. 3). CUL3 is ubiquitously 
expressed and is in all nephron segments, with particularly high expres- 
sion in the proximal tubule, but also in the DCT and collecting duct 
(Supplementary Fig. 7). These findings are consistent with both proteins 
having a role in the regulation of salt and electrolyte homeostasis in the 
distal nephron. 

Thereare highly significant differences in phenotypic severity among 
PHAII patients with mutations in different genes (Table 1 and Sup- 
plementary Figs 1, 2 and Supplementary Table 4). Subjects with CUL3 
mutations presented at much younger ages than those with mutation in 
KLHL3, WNK1 or WNK4, had significantly more severe hyperkalaemia 
and metabolic acidosis, and were far more likely to have hypertension 
before age 18 (others commonly develop hypertension at later ages). 
The majority of subjects with CUL3 mutations demonstrated failure 
to thrive or growth impairment. These observations, in conjunction 
with the high rate of CUL3 de novo mutation, support impairment of 
reproductive fitness. Among the other mutant loci, there remain sig- 
nificant differences in disease severity (rank order recessive KLHL3 > 
dominant KLHL3 > WNK4 > WNK1). 


Table 1 | PHAII phenotypes, stratified by genotype. 


(c) demonstrates KLHL3 expression in the DCT with apical localization 
(arrowheads). d-f, Staining for KLHL3 (d), AQP2 (e) and the merged image 
(f) demonstrates KLHL3 expression in the collecting duct. 


KLHL3 and CUL3 mutations account for 79% of kindreds in our 
cohort. Gene identification was complicated by the combined effects of 
locus heterogeneity, two modes of transmission at one locus, and few 
informative meioses. Many so far unsolved Mendelian traits may have 
similar complexities. Use of control exomes as comparators for ana- 
lysis of mutation burden may be broadly applicable to discovery of 
such loci. 

The most parsimonious mechanism of KLHL3 and CUL3 mutations 
is that they abrogate ubiquitination of targets normally bound by 
KLHL3, activity that is required for normal modulation of renal salt, 
K* and H* handling in response to physiological challenge; this specu- 
lation will require biochemical verification. The fact that recessive 
mutations in KLHL3 cause PHAII without other diverse effects implies 
either that KLHL3 targets are highly restricted to the renal salt and 
electrolyte pathway, or that loss of KLHL3 function at other targets can 
be compensated by other loci. BT'B-kelch-CUL3 CRLs can act as 
dimers, with two substrate-binding domains capable of engaging the 
same target molecule’. This suggests a potential mechanism to explain 
dominant-negative effects of KLHL3 and CUL3 mutations. 

CUL3-based CRLs participate in a wide range of critical cellular 
processes® via binding diverse BTB-domain-containing proteins*>. 
CUL3 mutations affecting all or many of these activities would 
undoubtedly produce very broad phenotypes. CUL3 mutations in 
PHAII merely phenocopy the effects of loss of KLHL3, suggesting that 
they selectively abrogate function at KLHL3 targets. The stereotypic 
consequences of CUL3 mutations, all deleting 57 amino acids in a 
region linking the BTB-binding and RING-binding domains, support 
such a specific effect. Consistent with this possibility, introduction of a 
flexible linker sequence to this region of CUL1 leaves substrate protein 
binding and ubiquitin polymerization intact, but nonetheless abolishes 
ubiquitination of a normal substrate”’. 


Mutant gene No. of kindreds No. of affecteds Dx/Ref age* K* (mM) HCO3 (mM) Hypertension at 
(nl 3.5-5.0 mM)+ (nl 22-28 mM)t <age 18 (%)+ 

CUL3 7 21 9+6 7.5+09 5.5: 2:0 94 

KLHL3 recessive 8 14 26+14 6.8+0.5 716%. 1:5) 14 

KLHL3 dominant 6 40 24+18 6.2+0.6 T1225 17 

WNK4 5 15 28+18 6.4+0.7 20.8+2.3 10 

WNK1 2 23 36+ 20 5.8+08 224446 13 
Dx/Ref age, age at diagnosis or referral. K*, serum potassium; HCO3_, serum bicarbonate; nl, normal range. Hypertension at =age 18 (%), percentage of affected patients diagnosed with hypertension by age 18. 
Values for Dx/Ref age, K* and HCO3_ are means + standard deviations. Significance of differences among genotype classes was calculated by ANOVA (Dx/Ref age, K*, HCO3_) or Fisher’s exact test (hypertension). 


*P=0.0002. 
+P<0.0001. 
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Thiazide diuretics correct abnormalities in virtually all PHAII subjects; 
similar correction is seen in a mouse model of PHAII with either 
thiazides or genetic ablation of NCC'°. These findings suggest that 
increased NCC activity is likely to be a common pathogenic mech- 
anism. Co-expression of KLHL3 and CUL3 with NCC in the DCT, and 
evidence that NCC is ubiquitinated** is consistent with this notion. 
ROMK and the H* ATPase are respectively required for net renal K* 
secretion’’ and H™ secretion™ and are also likely targets, although their 
activity is expected to be decreased, rather than increased, in PHAII. 
Another thiazide-sensitive Na—Cl cotransport pathway in the collect- 
ing duct has recently been described, suggesting an additional potential 
target”. Whether a KLHL3-CUL3 CRL acts directly or indirectly on 
these targets, whether they alter delivery of NCC and other targets to, 
or retrieval from, the plasma membrane, and what upstream pathways 
regulate this activity is unknown. Similarly, whether KLHL3-CUL3 
and WNKs operate within the same or different pathways is presently 
unknown, it is of interest that segments of NRF2 that interact with the 
kelch propeller domain of KEAP1 are highly acidic’®, akin to the 
domain of WNK4 that is mutated in PHAII"’. 

These findings demonstrate previously unrecognized roles for 
KLHL3 and CUL3. Understanding the upstream regulators and 
downstream targets of KLHL3-CUL3 activity will provide further 
insight into the mechanisms underlying maintenance of blood pres- 
sure and electrolyte homeostasis in response to diverse environmental 
challenges. 


METHODS SUMMARY 


A cohort of 52 PHAII kindreds comprising 126 affected subjects was ascertained, 
characterized and recruited for study. Index cases of eleven kindreds were chosen 
for whole exome capture and sequencing, and novel variants were identified. 
Genes were prioritized for follow-up and subjected to Sanger sequencing of index 
cases of each kindred. Segregation of rare variants within kindreds was analysed. 
KLHL3 and CUL3 were localized in kidney by staining with specific antibodies and 
immunofluorescence microscopy. Effects of CUL3 mutations on RNA splicing 
were studied by analysis of spliced products produced in mammalian cells. 


Full Methods and any associated references are available in the online version of 
the paper at www.nature.com/nature. 
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METHODS 


Study subjects. Index cases were referred for PHAII. Patients and participating 
family members provided consent to a study protocol approved by the Yale 
Human Investigation Committee. Control exomes were 699 unrelated subjects 
of European ancestry without hypertension, sequenced as part of diverse gene 
discovery projects. Genomic DNA was isolated from venous blood using standard 
methods. 

Exome capture, sequencing and variant calling. Genomic DNA from eleven 
PHAII index cases and 699 controls was captured on NimbleGen 2.1M human 
exome arrays (Roche) and sequenced on the Illumina GenomeAnalyzer as previ- 
ously described*’. Reads were mapped to the reference genome (hg18) using Maq** 
and genotypes of targeted bases were called with SAMtools”. Variants found in 
dbSNP v.130 or 1000 Genomes databases were excluded from further analysis. 
Remaining variants were considered ‘novel’ and annotated for impact on the 
encoded protein, conservation and expression’’. Aligned reads were viewed with 
the Integrative Genomics Viewer™*. Among PHAII cases, 94.2% of targeted bases 
were read by eight or more independent reads; sensitivity and specificity of 
heterozygous calls were estimated at 93.7% and 99.9% by comparison to 
Illumina SNP genotyping. Among controls, 94.4% of targeted bases were read 
by eight or more independent reads; sensitivity and specificity of heterozygous 
calls were estimated at 94.5% and 99.8%. Sanger sequencing of 212 novel variants 
from controls with a SAMtools quality score of =100 demonstrated validation in 
211 and amplification failure in 1, supporting high specificity of variant calls. 
SNP genotyping and linkage analysis. For the 11 PHAII index cases and their 
affected relatives (five trios and one quartet) genome-wide SNP genotyping was 
performed using Illumina Human610-Quad BeadChips and GenomeStudio soft- 
ware. Approximately 40K tag SNPs were extracted using Plink*’. Analysis of 
linkage was performed using Merlin’, specifying an autosomal dominant model 
with no phenocopies. Variants from exome sequencing in regions of the genome 
that were excluded (lod score <—2) were removed from further analysis, while 
those that supported linkage were prioritized for further evaluation. In kindreds 
showing potential recessive transmission of PHAII, SNP genotypes were examined 
for regions of homozygosity, and linkage was performed specifying an autosomal 
recessive model. 

Sanger sequencing of KLHL3 and CUL3. PCR amplification and Sanger sequen- 
cing from genomic DNA was performed using standard methods. Primers were 
designed with Primer3*’. Variants identified by exome sequencing were verified. 
All exons and flanking intronic sequences of KLHL3 and CUL3 were sequenced 
from all PHAII index cases. Previously unidentified mutations were discovered 
and verified by independent amplification and sequencing. Co-segregation of 
mutations with disease was determined by sequencing in all available kindred 
members. CUL3 exon 9 and its flanking intronic sequence was sequenced in 
150 unaffected unrelated controls, none of whom were found to harbour previ- 
ously unidentified variants. It is noteworthy that because of lower or absent 
sequence coverage at or near intron-exon junctions, splice donor and acceptor 
mutations in CUL3 were suggested in two of the eleven PHAII exomes (SAMtools 
quality scores 96 and 75) but three branch site mutations were outside the exome 
sequence and one splice enhancer mutation was poorly covered (SAMtools quality 
score 3). 

Genome-wide assessment of mutation burden. Genes show substantial vari- 
ation in the prevalence of novel or rare protein-altering variants for biological 
reasons, including differences in gene size and variation in the proportion of bases 
that are under purifying selection, and for technical reasons, including difficulties 
in accurately mapping short sequence reads among closely related paralogues. 
These factors can limit the ability to directly identify disease loci by simply count- 
ing and ranking genes according to the absolute number of such variants, particu- 
larly for diseases with substantial locus or model heterogeneity. This gene-to-gene 
variation can be accounted for by use of control exome data. The prevalence of rare 
variation in each gene in case exomes was compared to the corresponding pre- 
valence in a large set of control exomes with a Fisher’s exact test. Variants included 
in the analysis were protein-altering (missense, nonsense and splice site muta- 
tions) and high quality (=8 independent reads and SAMtools quality score =100). 


For a gene-wise test of rare variant burden in a genome with ~21,000 genes, 
correction for multiple testing suggests a threshold P value of ~2.4 X 10°, 
anticipated to produce a false discovery rate (FDR) of one gene per twenty experi- 
ments. The FDR of the Fisher’s test was evaluated by Monte Carlo simulation, 
which confirmed an FDR of <1 gene per 20 experiments (Supplementary Table 1). 
The power to identify trait-related loci was estimated as a function of the number 
of variants detected in cases and the number of case exomes sequenced 
(Supplementary Table 2), and the test was applied comparing the eleven PHAII 
and 699 control exomes (Supplementary Table 3). 

Orthologue and paralogue comparisons. Protein sequences of orthologues and 
paralogues were aligned with Clustal W*. Crystal structures were examined with 
Cn3D*. The locations of KLHL3 propeller mutations were compared to the crystal 
structure of human KLHL2 (PDB accession 2XN4)”, the closest human paralogue* 
(85% amino acid identity in the propeller). The location of the peptide encoded by 
CUL3 exon 9 was approximated by comparison to the crystal structure of human 
CULI (PDB accessions 1LDJ and 1LDK)”””. 

Splicing assay. A 3,782 bp segment of CUL3, extending from 287 bp proximal to 
exon 8 to 327 bp distal to exon 10, was amplified by PCR (Advantage 2 polymerase, 
Clontech) from genomic DNA of nine PHAII patients with different CUL3 muta- 
tions and one subject with wild-type CUL3 sequence. Products were cloned into the 
pcDNA6.2/GW/D-TOPO mammalian expression vector (Invitrogen), and plasmids 
were purified (QIAprep, Qiagen) and sequenced. HEK293 cells were transfected 
independently with each plasmid using Lipofectamine 2000 (Invitrogen) and 
harvested ~24h post-transfection. RNA was isolated using RNeasy with DNase 
on-column digestion (Qiagen). The spliced expression products were assessed by 
reverse transcription with oligo(dT) priming (Superscript III RT, Invitrogen) 
followed by PCR with vector-specific and CUL3-specific primers. Products were 
fractionated and visualized via agarose gel electrophoresis, and sequenced. 
Untransfected and water controls were negative. 

Immunofluorescence. Fresh frozen mouse kidney sections were fixed with ethanol 
at 4°C for 30min and acetone at —20°C for 3 min, washed with 1x PBS, and 
permeabilized with 0.1% Triton X-100 (Sigma) at room temperature (22 °C) for 
10min. Sections were blocked with 10% donkey serum and 1% bovine serum 
albumin at room temperature for 1h, incubated with primary antibodies at room 
temperature for 1h or 4°C overnight, washed four times with 1X PBS, incubated 
with secondary antibody at room temperature for 1 h, and washed four times with 
1X PBS, with DAPI nuclear counterstain in the second wash. Slides were mounted 
with Mowiol (Polysciences) and 1% n-propyl gallate (Sigma) as an anti-fade agent. 
Primary antibodies included 1:100 rabbit anti-KLHL3, 1:50 rabbit anti-CUL3, 1:800 
or 1:1,200 guinea pig anti- TRPM6 (ab66655, ab1871, and ab47017; Abcam) and 
1:400 or 1:800 goat anti-AQP2 (sc-9882, Santa Cruz Biotechnology). Secondary 
antibodies, diluted 1:800, included donkey Cy3 anti-rabbit, Cy2 anti-guinea pig and 
649 anti-goat IgG (AffiniPure, Jackson Immunoresearch). Staining with secondary 
antibodies only was consistently negative. 
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Probing sporadic and familial Alzheimer’s disease 
using induced pluripotent stem cells 
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Our understanding of Alzheimer’s disease pathogenesis is currently 
limited by difficulties in obtaining live neurons from patients and the 
inability to model the sporadic form of the disease. It may be possible 
to overcome these challenges by reprogramming primary cells from 
patients into induced pluripotent stem cells (iPSCs). Here we repro- 
grammed primary fibroblasts from two patients with familial 
Alzheimer’s disease, both caused by a duplication of the amyloid-B 
precursor protein gene’ (APP; termed APP”?), two with sporadic 
Alzheimer’s disease (termed sAD1, sAD2) and two non-demented 
control individuals into iPSC lines. Neurons from differentiated 
cultures were purified with fluorescence-activated cell sorting and 
characterized. Purified cultures contained more than 90% neurons, 
clustered with fetal brain messenger RNA samples by microarray 
criteria, and could form functional synaptic contacts. Virtually all 
cells exhibited normal electrophysiological activity. Relative to 
controls, iPSC-derived, purified neurons from the two APpPP 
patients and patient sAD2 exhibited significantly higher levels of 
the pathological markers amyloid-B(1-40), phospho-tau(Thr 231) 
and active glycogen synthase kinase-3B (aGSK-3B). Neurons from 
APP”? and sAD2 patients also accumulated large RAB5-positive 
early endosomes compared to controls. Treatment of purified 
neurons with f-secretase inhibitors, but not y-secretase inhibitors, 
caused significant reductions in phospho-Tau(Thr 231) and aGSK- 
3B levels. These results suggest a direct relationship between APP 
proteolytic processing, but not amyloid-B, in GSK-36 activation and 
tau phosphorylation in human neurons. Additionally, we observed 
that neurons with the genome of one sAD patient exhibited the 
phenotypes seen in familial Alzheimer’s disease samples. More 
generally, we demonstrate that iPSC technology can be used to 
observe phenotypes relevant to Alzheimer’s disease, even though it 
can take decades for overt disease to manifest in patients. 
Alzheimer’s disease is a common neurodegenerative disorder, 
defined post mortem by the increased presence of amyloid plaques 
and neurofibrillary tangles in the brain’. Amyloid plaques are extracel- 
lular deposits consisting primarily of amyloid-B peptides, and neuro- 
fibrillary tangles are intraneuronal aggregations of hyperphosphorylated 
tau, a microtubule-associated protein involved in microtubule stabiliza- 
tion*. The causative relationship between amyloid plaque/amyloid-B 
and tau pathologies is unclear in humans. Although the vast majority 
of Alzheimer’s disease is apparently sporadic with significant non- 
Mendelian genetic contributions’, analyses of cellular and animal 
models of rare, dominantly inherited familial forms of Alzheimer’s 
disease have driven most ideas about disease mechanisms. These rare 
cases have mutations or a duplication of APP, which encodes the 
amyloid-B precursor protein, or mutations in the presenilin genes, 
which encode proteolytic enzymes that cleave APP into amyloid-f 


5,10 


and other fragments. Mouse models that overexpress familial 
Alzheimer’s disease mutations develop extensive plaque deposition 
and amyloid-associated pathology, but neurofibrillary tangles and sig- 
nificant neuronal loss are conspicuously absent*°. Fetal human cortical 
cultures have also been used to study the APP-tau relationship. For 
example, cortical cultures treated with 20 1M amyloid-f have elevated 
phosphorylated tau (p-tau)’. However, it is still unclear whether 
physiologically relevant levels of amyloid-f directly cause elevated 
p-tau and which kinases are directly involved in this aberrant phos- 
phorylation. Additionally, experimental approaches using fetal human 
neurons are hindered by limited availability of samples and unknown 
genetic backgrounds. The recent developments in iPSCs and induced 
neurons have allowed investigation of phenotypes of neurological 
diseases in vitro**’°. However, not all diseases have been successfully 
modelled using iPSCs!', and it is unclear whether iPSCs can be used to 
study sporadic forms of disease. 

Here we report the derivation and neuronal differentiation of iPSCs 
from patients with familial and sporadic Alzheimer’s disease, as well as 
from non-demented, age-matched controls. Using purified human 
neurons we probe three key questions concerning Alzheimer’s disease: 
(1) can iPSC technology be used to observe phenotypes of patients with 
Alzheimer’s disease, even though it can take decades for overt disease to 
manifest; (2) is there a causative relationship between APP processing 
and tau phosphorylation; and (3) can neurons with the genome of a 
sAD patient exhibit phenotypes seen in familial Alzheimer’s disease 
samples? Supplementary Fig. 1 summarizes the experimental approach 
and findings. 

We characterized APP metabolism in fibroblasts before reprogram- 
ming to iPSCs (Supplementary Fig. 2). APP expression and amyloid-f 
secretion were quantified in early-passage primary fibroblasts from two 
non-demented control (NDC) individuals, two sAD patients and two 
APP”? patients (Table 1). The presence of the genomic duplication 
was confirmed in fibroblasts. Relative to NDC and sAD cells, APpPP 
fibroblasts expressed higher levels of APP mRNA and secreted 1.5- to 
twofold higher amounts of amyloid-B(1-40) peptides into culture 
media compared to NDC cells. We did not detect significant increases 
in amyloid-B(1-42/1-40) or amyloid-$(1-38/1-40) in patient samples 
versus controls. 

We generated iPSC lines by transducing fibroblasts with retro- 
viruses encoding OCT4, SOX2, KLF4, c-MYC and, in one-third of 
cultures, EGFP. Each of the six individuals was represented by three 
clonal iPSC lines. All 18 iPSC lines maintained embryonic stem (ES)- 
cell-like morphology, expressed the pluripotency-associated proteins 
NANOG and TRA1-81, maintained euploid karyotypes, expressed 
endogenous locus-derived SOX2, repressed retroviral transgenes, 
and could differentiate into cells of ectodermal, mesodermal and 
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Table 1 | Summary of patient information 


Code Diagnosis Gender Family history Age at onset Age at biopsy MMSE at biopsy APOE 
NDC1 Non-demented control M Possible N/A 86 30 2-3 
NDC2 Non-demented control M N N/A 86 30 3-3 
sAD1 Sporadic AD F N 78 83 4 3-3 
sAD2 Sporadic AD M N 78 83 18 3-3 
APP>P1 Familial AD, APP duplication M Y 46 51 21 3-3 
APP?P2 Familial AD, APP duplication F ¥ 53 60 i? 3-3 


MMSE, mini mental state examination (perfect score = 30). AD, Alzheimer’s disease. 


endodermal lineages in vitro (Fig. la-d, Supplementary Figs 3a-e 
and 4). All lines tested (one per individual) formed teratomas when 
injected into nude rats (Supplementary Fig. 5). Supplementary Table 1 
provides details of each iPSC line. 

Variability in differentiation efficiency exists between pluripotent cell 
lines. To analyse variability in our iPSC lines, we used a fluorescence- 
activated cell sorting (FACS)-based method of neuronal differentiation 
and purification (summarized in Supplementary Fig. 6), based on work 
described previously”’. Briefly, the 18 iPSC lines were first differentiated 
into cultures containing neural rosettes (Supplementary Fig. 3f). From these 
cultures, neural progenitor cells (NPCs) were purified and the efficiency 
of NPC formation was assessed by CD184*CD15*CD44 CD271— 
immunoreactivity. These FACS-purified NPCs maintained expression 
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Figure 1 | Generation of iPSC lines and purified neurons from APP”?, sAD 
and NDC fibroblasts. a, b, iPSC lines express NANOG and TRA1-81. 

c, d, iPSC-derived, FACS-purified NPCs express SOX2 and nestin. e-h, iPSC- 
derived, FACS-purified neurons express MAP2 and BIlII-tubulin. Scale bars in 
a-h, 50 tm. i, Representative action potentials in response to somatic current 
injections. Data from iPSC line APP”?2.2. j, Spontaneous synaptic activity was 
detected (voltage clamp recording at the reversal potential of sodium (0 mV)) 
and reversibly blocked by GABA, receptor antagonist $R95531 (10 1M). Each 
panel represents ~4 min continuous recordings separated in 25 sweeps (grey 
traces) and superimposed for clarity. Black traces represent a single sweep. Data 
from iPSC line NDC2.1.k, 1, No significant difference was seen between NDCs 
and any patient’s cultures in the ability of iPSCs to generate NPCs at day 11 
(P = 0.08, n = 9), or the ability of NPCs to form neurons at 3 weeks (P = 0.82, 
n= 9). Error bars indicate s.e.m. 
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of NPC-associated markers, such as SOX2 and nestin, over multiple 
passages (Fig. 1c, d). NPCs were differentiated for 3 weeks into hetero- 
geneous cultures containing neurons (Supplementary Fig. 3g, h). APP 
copy number was faithfully maintained in differentiated cultures 
(Supplementary Fig. 3i). From these cultures, neurons were purified 
to near homogeneity, and the efficiency of neuron generation was 
assessed by CD24*CD184 CD44” immunoreactivity. No significant 
differences between any of the individuals in the efficiency of NPC or 
neuronal differentiation were detected (Fig. 1k, 1). 

Although we observed variability in differentiation among lines 
from each individual, the extent of inter-individual variation was less 
than observed intra-individual variability. These results suggest that 
any observed biochemical aberrations in neurons, if present in 
multiple lines derived from the same patient, are probably caused by 
features of that patient’s genotype. Purified neurons were plated at a 
density of 2 X 10° cells per well of a 96-well plate and cultured for an 
additional 5 days. More than 90% of cells in these cultures were 
neurons, as judged by the presence of BIII-tubulin*, MAP2* projec- 
tions (Fig. le-h). Genome-wide mRNA expression profiles of five 
representative purified neuronal cultures were compared to the par- 
ental iPSC lines and samples from fetal brain, heart, liver and lung 
(Supplementary Fig. 7 and Supplementary Table 2). Unsupervised 
hierarchical clustering analysis revealed that purified neurons most 
closely resembled fetal brain samples, in part due to a global upregulation 
of neuronal genes. Interestingly, the largest difference between fetal 
brain samples and purified neurons was downregulation in purified 
neurons of the hippo signalling cascade (~6.1 fold), which regulates 
proliferation of cells such as NPCs and glia’*™*. 

We determined multiple electrophysiological properties of purified 
neurons to assess passive membrane properties and synaptic connec- 
tivity (Fig. li, j, Supplementary Table 3 and Supplementary Fig. 8). 
Notably, virtually all neurons tested generated voltage-dependent 
action potentials and currents (Fig. li), which were blocked by 
tetrodotoxin (Supplementary Fig. 8). Transient bath application of 
ionotropic receptor agonists (25 uM muscimol or 101M AMPA) 
evoked transient currents, showing that purified neurons expressed 
functional GABA and AMPA receptors, respectively (Supplementary 
Table 3). To determine whether neurons were also able to form func- 
tional synaptic contacts, we analysed continuous whole-cell voltage 
clamp recordings. We detected spontaneous inhibitory and/or excitatory 
synaptic currents in a subset of cells (~40%). Analysis of the kinetics of 
those events combined with reversible blockade using GABA, or AMPA 
receptor antagonists demonstrated that the neurons not only fire action 
potentials but also made functional synaptic contacts (Supplementary 
Table 3). The electrophysiological results were supported by analysis 
of expression of protein markers of glutamatergic and GABAergic 
neuronal subtypes (VGluT1 and GABA, respectively), which were 
detected by immunofluorescence, with approximately 15% of cells 
staining brightly for VGluT1 and 8% for GABA, and most remaining 
neurons staining dimly for one of the markers (Supplementary Fig. 9a). 
RNAs indicative of glutamatergic, GABAergic and cholinergic sub- 
types (that is, VGLUT1, GAD67 and CHAT, respectively) were detected 
by quantitative polymerase chain reaction (qPCR). Importantly, no 
significant differences in neuronal subtypes were detected between 
patients and controls (Supplementary Fig. 9b-f). 

Elevated or altered secretion of amyloid-B peptides by fibroblasts 
is a feature common to all familial Alzheimer’s disease mutations 
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identified so far’*’’. It is not known if iPSC-derived neurons from 
familial Alzheimer’s disease patients maintain the elevated amyloid-B 
production seen in the parental fibroblasts. In sAD fibroblasts and other 
peripheral cells, APP expression and amyloid-f secretion are not con- 
sistently altered’”. To determine if iPSC-derived neurons from APP”? 
and sAD patients exhibit elevated amyloid-f secretion, amyloid-f levels 
in neuron-conditioned media were measured and normalized to total 
protein levels of cell lysates. Purified neurons from patients APP”? 1 and 
APP”?2, each represented by three independently derived iPSC lines, 
secreted significantly higher levels of amyloid-B(1-40) compared to 
mean NDC levels (Fig. 2a). Neurons from patient sAD2 also had sig- 
nificantly higher amyloid-B(1-40) levels compared to NDC neurons, 
even though no difference was observed between the fibroblasts of 
sAD2 and NDC individuals. We found that amyloid-B(1-42) and 
amyloid-$(1-38) levels in these purified neuronal cultures were often 
below the detection range of our assay, owing to the relatively small 
number of neurons purified. By cell type, neurons exhibited a larger 
difference in amyloid-f levels between APP”? and NDC than fibro- 
blasts, further suggesting that fibroblasts are not fully predictive of 
neuronal phenotypes (Fig. 2b). 
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Figure 2 | Increased amyloid-f, p-tau and aGSK-3B in sAD2 and APP”? 
neuronal cultures. a, Purified neurons from sAD2, APP?P1 and APPPP2 
secrete increased amyloid-B(1-40) (AB(1-40)) compared to NDC samples 

(P = 0.0012, 0.0014 and <0.0001, respectively). b, Amyloid-B differences 
between patients and controls are larger in neurons versus fibroblasts. Data sets 
are relative to NDC mean.c, d, Neurons from sAD2, APP??1 and APP”P2 have 
increased aGSK-3f (percentage non-phospho-Ser 9) and p-tau/total tau (p- 
tau/t-tau) compared to NDC samples (aGSK-3B, P < 0.0001, 0.0005 and 
0.0001; p-tau/total tau, P< 0.0001, 0.0001 and 0.0002). In a-d, n values on 
graphs indicate the number of biological replicates per patient, contributed 
equally by three iPSC lines. e, sAD2 findings verified in two additional iPSC 
lines (sAD2.4 and sAD2.5). sAD2(1-3) indicates findings from initial sAD2 
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Genetic evidence implicates altered or elevated APP processing 
and amyloid-B levels as the driving agents behind familial 
Alzheimer’s disease” and, because of identical neuropathology, spora- 
dic Alzheimer’s disease. However, tau, although not genetically linked 
to Alzheimer’s disease, forms neurofibrillary tangles, which correlate 
better with disease severity than plaque numbers'*. The mechanism by 
which altered APP processing might cause elevated p-tau and 
neurofibrillary tangle pathology is unclear. Tau phosphorylation at 
Thr 231, one of several tau phosphoepitopes, regulates microtubule 
stability’? and correlates with both neurofibrillary tangle number 
and degree of cognitive decline’. To determine if tau phosphoryla- 
tion at Thr 231 is elevated in APP”? and sAD neurons, we measured 
the amount of p-tau(Thr 231) relative to total tau levels in lysates from 
purified neurons from three iPSC lines from each of the NDC, sAD 
and APP”? patients. Neurons from both APP”? patients had signifi- 
cantly higher p-tau/total tau ratios than neurons from NDC lines 
(Fig. 2c). p-Tau/total tau in the two sAD patients mirrored the 
amyloid-f findings: no difference was observed between sAD1 and 
NDC neurons whereas sAD2 neurons had significantly increased 
p-tau/total tau. 
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iPSC lines. For amyloid-f, aGSK-3 and p-tau/total tau, sAD2 remained 
significantly higher than controls (P < 0.0001). No significant difference was 
found between original and secondary sAD2 lines (P = 0.14, 0.44, 0.63). 

f, Strong positive correlations between amyloid-B(1-40), aGSK-3B and p-tau/ 
total tau in purified neurons. Pearson R = 0.94, 0.91 and 0.83, respectively. 

g, Twenty-four hour treatment with B- and y-secretase inhibitors reduced 
secreted amyloid-B(1-40) compared to control treatment. f-Secretase 
inhibitors partially rescued aGSK-36 and p-tau/total tau in sAD2 and APP”?2 
neurons (P < 0.01 for aGSK-3B, P < 0.03 for p-tau). y-Secretase inhibition did 
not significantly affect aGSK-3B and p-tau/total tau. In g, number of treatment 
sets is indicated on the graph (n), NDCs are represented by two iPSC lines each 
and sAD2 and APP”??2 are represented by three. Error bars indicate s.e.m. 
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Tau can be phosphorylated by multiple kinases. The kinase GSK-3B 
can phosphorylate tau at Thr 231 in vitro and co-localizes with neuro- 
fibrillary tangles and pre-tangle phosphorylated tau in sAD post- 
mortem neurons”. GSK-3f is thought to be constitutively active but 
is inactivated when phosphorylated at Ser 9 (ref. 23). To determine if 
iPSC-derived neurons with elevated p-tau have increased GSK-3f 
activity, the proportion of aGSK-38 in purified neurons was calculated 
by measuring the amount of GSK-3B lacking phosphorylation at Ser 9 
relative to total GSK-3B levels. We observed that neurons from patients 
APP”?1, APP”P2 and sAD2 had significantly higher aGSK-3B than 
NDC neurons (Fig. 2d). The amyloid-f, GSK-3f and tau findings of 
sAD2 were verified by analysing an additional two iPSC lines (sAD2.4 
and sAD2.5; characterization in Supplementary Fig. 10), and we 
observed that levels remained consistently elevated (Fig. 2e). Results 
are detailed per patient in Supplementary Table 4a, per cell line in 
Supplementary Fig. 11, and per cell culture in Supplementary Table 5. 

Although amyloid-B, p-tau and GSK-3B clearly have roles in 
Alzheimer’s disease pathogenesis, their relationship is unclear. We 
observed that iPSC-derived neurons exhibited strong or very strong 
correlations between amyloid-[B(1-40), p-tau/total tau and aGSK-3B 
levels (Fig. 2f and Supplementary Table 4b). We reasoned that if APP 
proteolytic products, such as amyloid-f or carboxy-terminal fragments 
(CTFs), have a causative role in p-tau and aGSK-3f elevation, then 
inhibiting y- or B-secretase activity could reduce p-tau and aGSK-3f. 
We treated purified neurons from NDC1, NDC2, sAD2 and APP?P2 
(2-3 iPSC lines each) with y-secretase inhibitors (CPD-E and DAPT) or 
B-secretase inhibitors (BSi-II and OM99-2) for 24h and measured 
amyloid-B, GSK-3B and p-tau/total tau compared to vehicle-treated 
samples. All inhibitors reduced amyloid-B(1-40) by similar levels 
(32-45% in patient samples) (Fig. 2g). Intriguingly, for both sAD2 
and APP?P2 neurons, we observed that -secretase inhibitors signifi- 
cantly reduced aGSK-3f and p-tau/total tau (Fig. 2g, and shown per 
iPSC line in Supplementary Fig. 12). Neither y-secretase inhibitor 
significantly differed from control-treated samples for aGSK-3f levels 
and p-tau/total tau. 

We extended phenotypic characterization of sAD2 and APP”? by 
analysing endosomal and synaptic markers in FACS-purified neurons 
co-cultured with astrocytes for 12 days. Accumulation of large RAB5* 
early endosomes in neurons has been observed in autopsies from 
sporadic Alzheimer’s disease and some forms of familial Alzheimer’s 
disease**”’. As B-secretase is localized to endosomes and has an acidic 
pH optimum, it has been proposed that early endosomes potentially 
mediate the effects of APP processing on downstream pathologies 
such as increased p-tau, neurofibrillary tangles, synaptic loss and 
apoptosis*®; however, these hypotheses have been difficult to test 
directly without live, patient-specific neurons. To determine if early 
endosome phenotypes are present in iPSC-derived neurons from 
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Alzheimer’s disease patients, purified neurons from NDC1, NDC2, 
sAD2 and APP”?2 (two iPSC lines each) co-cultured with astrocytes 
were harvested and large and very large Rab5* early endosomes (1- 
2.1m? and 2.1-7 1m’) in neuronal soma were counted. Whereas 
control neurons generally had few Rab5" structures >1 um’, neurons 
from both sAD2 and APP”?2 frequently had Rab5~ early endosomes 
highly similar in volume, morphology and localization to what has 
been observed in autopsy samples (Fig. 3a—c). When compared, the 
neurons from both sAD2 and APP”?2 had significantly increased 
numbers of both large and very large early endosomes relative to 
controls (Fig. 3d). We sought to determine if neuronal cultures from 
sAD2 and APP”?2 also contained reduced levels of the presynaptic 
marker synapsin I. In Alzheimer’s disease autopsies, synaptic loss is 
one of the strongest pathological correlates with dementia severity, and 
in regions of the brain affected by Alzheimer’s disease, the presynaptic 
marker synapsin I is decreased in patients versus controls’’**. To 
analyse synapsin I levels in iPSC-derived neurons co-cultured with 
astrocytes, we quantified synapsin I* puncta on MAP2* dendrites 
(Fig. 3e). We found no significant difference between NDCs and either 
sAD2 or APP”?2 in the number of puncta per tm dendrite (Fig. 3f). 
Extended culture periods may be required to study Alzheimer’s dis- 
ease-associated loss of synaptic proteins. 

The results of this study provide strong evidence that iPSC technology 
can be used in concert with post-mortem samples and animal models to 
study early pathogenesis and drug response in sporadic and familial 
Alzheimer’s disease. In purified, electrophysiologically active neurons 
from one sporadic Alzheimer’s disease and two APP’? patients, each 
represented by at least three clonally derived iPSC lines, we observed 
significantly increased levels of three major biochemical markers of 
Alzheimer’s disease: amyloid-B(1-40), aGSK-3B and p-tau/total tau. 
Increased sAD2 amyloid-B levels were not observed in the parental 
fibroblasts, suggesting a cell-type-specific phenotype. Among the indi- 
viduals in this study, not only did strong correlations exist between 
amyloid-$(1-40), p-tau/total tau and aGSK-3f, but both p-tau/total 
tau and aGSK-36 levels were also partially rescued in neurons from 
sAD2 and APP”? following treatment with B-secretase inhibitors, sug- 
gesting that the APP processing pathway has a causative role in tau 
Thr 231 phosphorylation in human neurons. Because y-secretase 
inhibition did not cause a significant effect, products of APP processing 
other than amyloid-f may have a role in induction of GSK-3f activity 
and p-tau. One potential culprit is the B-CTF, the levels of which cor- 
relate with axonopathies in mouse models harbouring APP duplica- 
tions”? and mediate early endosome accumulation in human Down’s 
syndrome fibroblasts’. The observation that neurons from patients 
sAD2 and APP”? have early endosome phenotypes raises the question 
of how aberrant early endosomes relate to other phenotypes of 
Alzheimer’s disease, such as axonopathies, synaptic loss and cell death, 


Figure 3 | Analysis of early endosome and 
synapsin levels in purified neurons co-cultured 
with astrocytes. a-c, Extended focus images of 
Rab5-stained neuronal soma from NDC1, sAD2 
and APP?P2. Arrowhead in b marks a 1-2.1 jum? 
early endosome, and the arrow marks a 2.1-7 um? 
early endosome. Scale bars, 10 jim. d, Neurons 
from sAD2 and APP”?2 have significantly 
increased numbers of large and very large early 
endosomes compared to NDC neurons 

(P < 0.0001, n = 40 neurons from two iPSC lines 
per individual). e, Representative image of synapsin 
I (green) ona MAP2°* dendrite (red). Arrowhead 
marks a synapsin I~ punctum. Scale bar, 3 um. 

f, No significant difference between patients and 
controls in the number of synapsin” structures per 
jim dendrite (P = 1.00, n = 40 dendrites from two 
iPSC lines per individual). Neurons were scored 
blinded to genotype. Error bars indicate s.e.m. 
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in human neurons. Neurons and synapses rely heavily on endocytic 
pathways, and thus iPSC technology can now be used to study the role 
of this dynamic process in live patient-specific neurons. One point of 
caution is that it is possible that the cultures of purified neurons that we 
generated and studied may not have been fully mature, as they lacked 
repetitive action potentials and had limited spontaneous activity. 
Although some types of mature neurons also have these properties, it 
is conceivable that the phenotypes we observed might be modified by 
duration of in vitro culture. In this context, while there is debate about 
when Alzheimer’s disease phenotypes initiate, evidence exists that 
Alzheimer’s disease-like pathology can occur in Down’s syndrome 
fetuses as early as 28 weeks of gestation”. 

Our finding that the genome of patient sAD2, but not patient sAD1, 
generates significant Alzheimer’s disease phenotypes in purified neurons 
has important implications. First, this finding suggests that an unknown 
frequency of sporadic Alzheimer’s disease patients will have genomes 
that generate strong neuronal phenotypes. The frequency of such 
genomes in the sporadic Alzheimer’s disease population cannot be 
determined from the small sample size we report and will require a 
larger sample size to ask how frequent such genomes are in the clinical 
population diagnosed with sporadic Alzheimer’s disease. Second, the 
genome of sAD2 clearly harbours one or more variants that generate 
Alzheimer’s disease phenotypes, which can thus be elucidated by 
future molecular genetic studies. Third, we speculate that sporadic 
Alzheimer’s disease might be sub-divided depending on whether 
neurons themselves are altered, as in the case of sAD2, as opposed to 
other cell types such as astrocytes, which could be altered in other cases, 
for example, sAD1. Thus, future iPSC studies examining larger 
numbers of patients and controls have the potential to provide great 
insight into the mechanisms behind the observed heterogeneity in 
sporadic Alzheimer’s disease pathogenesis, the role of different cell 
types, patient-specific drug responses, and prospective diagnostics. 


METHODS SUMMARY 


iPSC generation and differentiation. Primary fibroblast cultures were established 
from dermal punch biopsies taken from individuals following informed consent and 
Institutional Review Board approval. To generate iPSCs, fibroblasts were transduced 
with MMLYV vectors containing the complementary DNAs for OCT4, SOX2, KLF4, 
c-MYC and + EGFP. IPSC-derived NPCs were differentiated for 3 weeks, neurons 
were purified by FACS, and amyloid-f, p-tau/total tau and aGSK-38 were measured 
on purified control and mutant neurons from multiple lines cultured in parallel for 
an additional 5 days by multi-spot electrochemiluminescence assays (Meso Scale 
Diagnostics). Early endosomes were analysed by confocal microscopy on purified 
neurons co-cultured with human astrocytes (Lonza) for 12 days. To ensure repro- 
ducible and consistent data, we found that it is important to differentiate and evaluate 
neurons from full sets of mutant and control iPSC lines together. 

Statistics. P<0.05 was considered statistically significant. Individuals were 
statistically compared to the total NDC pool by Tukey’s test. Drug responses were 
compared to controls by Dunnett’s method. N values signify the total number of 
separate cultures analysed, with each iPSC line contributing equally to the total. 


Full Methods and any associated references are available in the online version of 
the paper at www.nature.com/nature. 
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METHODS 

Patients and fibroblast derivation. NDC and sAD individuals were enrolled in 
the longitudinal study at the UCSD Alzheimer’s Disease Research Center. APP”? 
individuals were patients at the Department of Clinical Medicine, Neurology, 
Oulu University Hospital, Oulu, Finland. For all individuals, dermal punch 
biopsies were taken following informed consent and Institutional Review Board 
approval. Primary fibroblast cultures were established from biopsies using 
established methods*'. Fibroblasts were cultured in DMEM containing 15% 
FBS, L-glutamine and penicillin/streptomycin. 

iPSC generation and expansion. iPSCs were generated as described”, with the 
following modifications. The cDNAs for OCT4, SOX2, KLF4, c-MYC and EGFP were 
cloned into pCX4 vectors*’ and vectors were packaged into VSVG-pseudotyped 
retroviruses. For each patient, three independent viral transductions were performed. 
Three wells, each containing ~1 X 10° fibroblasts, were transduced with retroviruses. 
On days 2-8, 2 mM valproic acid was added to cultures. Potential iPSC colonies were 
picked at ~3 weeks and transferred to 96-well plates containing irradiated mouse 
embryonic fibroblasts (MEFs). For passaging, cells were dissociated with TrypLE 
(Invitrogen). The efficiency of potential iPSC colony formation was roughly 100 
colonies per 1 X 10° fibroblasts at 3 weeks. The efficiency of successful establishment 
of a stable iPSC line from an initial colony was roughly 10%. 

Karyotype analysis and pluripotency assays. Karyotype analysis was performed 
by Cell Line Genetics. IPSCs were assayed for teratoma formation by injections 
into spinal cords of athymic rats, as previously described**, with the following 
modifications: cells were dissociated with Accutase (Innovative Cell Technologies) 
and passed through a 100-j1m mesh filter before injections and each animal 
received 10 injections of roughly 10,000 cells. For in vitro pluripotency assays, 
iPSC cultures were dissociated with dispase, and embryoid bodies were generated 
in low-attachment plates in media containing 15% fetal bovine serum (FBS). After 
7 days, cultures were plated on Matrigel (BD Biosciences)-coated glass coverslips 
and cultured for 7 days. 

Genotyping and qPCR. To determine APP copy number, genomic DNA was 
isolated from fibroblasts or differentiated NPCs. qPCR was performed using 
FastStart Universal SYBR Green Master Mix (Roche) and analysed on an 
Applied Biosystems 7300 Real Time PCR System using the AAC; method. APP 
levels were normalized to mean f-globin/albumin. To compare RNA levels 
between samples, RNA was purified (PARIS kit, Ambion), DNase treated 
(Ambion) and reverse transcribed (Superscript II, Invitrogen). For transgene 
expression, primers detected a sequence common to all transgenes, and expression 
was normalized to the housekeeping gene NONO. PCR to detect endogenous 
SOX2 expression was performed using Qiagen HotStarTaq and primers previously 
described”. 

Immunocytochemistry and microscopy. Cells were fixed in 4% paraformaldehyde, 
permeabilized with buffer containing TritonX-100 and stained with primary and 
secondary antibodies (see below). Samples, except for early endosome studies, were 
imaged on a Nikon TE2000-U inverted microscope and acquired using Metamorph 
software (Molecular Devices). ImageJ software (National Institutes of Health) was 
used to pseudo-colour images, adjust contrast, and add scale bars. Endosomes and 
synapses were imaged on a PerkinElmer UltraVIEW VoX microscope with a X60 
objective and a z-step of 0.5 tm. Quantifications were done blinded to genotype. 
Antibodies. The antibodies used for FACS purification of cells were TRA1-81- 
APC, CD184-APC, CD15-FITC, CD24-PECy7, CD44-PE and CD271-PE (all 
from BD Biosciences) and were used at a concentration of one test per 1 X 10° 
cells. The following antibodies were from Millipore: AFP (mouse 1:1,000), Appt 
(22C11, ms 1:1,000), SMA (ms 1:50), SOX2 (rb 1:2,000), synapsin I (rb 1:500); 
from Sigma: GABA (rb 1:200), MAP2a/b (ms 1:500), tau total (rb 1:500), tau 
Thr 231 (rb 1:150). Other vendors: APP°' (Zymed rb 1:500), GAPDH 
(Ambion ms 1:250), VGluT1 (Synaptic Systems rb 1:200), NANOG (Santa- 
Cruz rb 1:200), nestin (Santa-Cruz rb 1:1,000), RAB5A (Santa-Cruz rb 1:50), 
synuclein (BD ms 1:500), tau PHF1 (Pierce ms 1:500), BII-tubulin (Covance 
ms 1:2,000), BIl-tubulin (Covance rabbit 1:1,000) anti-rabbit Alexa Fluor 488 
(Invitrogen 1:200) and anti-mouse Alexa Fluor 568 (Invitrogen 1:200). 
Neuronal differentiation and FACS. To ensure reproducible and consistent data, 
we found that it is important to differentiate and evaluate neurons from full sets of 
mutant and control iPSC lines together. Differentiation to NPCs and neurons was 
performed as previously described'”. 3 x 10° FACS-purified TRA1-81~ cells were 
seeded onto 3 X 10 cm plates that were seeded the previous day with 5 X 10° PA6 
cells’. At day 11, cells were dissociated with Accutase and ~5 xX 10° 
CD184*CD15"CD44CD271~ NPCs were FACS-purified and plated onto 
poly-ornithine/laminin-coated plates and cultured with bFGF. At passage 7, 
NPCs were differentiated with BDNF, GDNF and cAMP. After 3 weeks of differ- 
entiation, cells were dissociated with Accutase and CD24*CD184 CD44° cells 
were purified. FACS was performed with a FACSAria II (BD Biosciences) and 


analysed with FloJo (Tree Star). Differentiation methods are also summarized in 
Supplementary Fig. 6. 

Gene expression profiling. Total RNA was extracted from collected sample 
pellets (Ambion mirVana; Applied Biosystems) according to the manufacturer’s 
protocol. RNA quantity (Qubit RNA BR Assay Kits; Invitrogen) and quality 
(RNA6000 Nano Kit; Agilent) was determined to be optimal for each sample 
before further processing. 200ng RNA per sample was amplified using the 
Illumina Total Prep RNA Amplification Kit according to manufacturer’s protocol 
and quantified as above. 750 ng biotinylated RNA per sample was hybridized to 
Illumina HT-12v4 Expression BeadChips, scanned with an Illumina iScan Bead 
Array Scanner, and quality controlled in GenomeStudio and the lumi bioconductor 
package. All RNA processing and microarray hybridizations were performed in 
house according to manufacturer’s protocols. In GenomeStudio, probes were 
filtered for those detected with a P value of 0.01 in at least one sample and exported 
for normalization in R. Raw probe expression values were transformed and 
normalized using the robust spline normalization (RSN) as implemented in the 
lumi R/Bioconductor package. Probes were further filtered for those having a 
minimum value of 150 in at least two samples and a minimum difference between 
any two samples (maximum value minus minimum value) of at least 150. 
Electrophysiology methods. Electrophysiology was performed on purified 
neurons, 5 days after FACS. For whole-cell patch-clamp recordings, individual 
coverslips were transferred into a heated recording chamber and continuously 
perfused (1 ml min ') with artificial cerebrospinal fluid (ACSF) bubbled with a 
mixture of CO, (5%) and Oz (95%) and maintained at 25 °C. The ACSF contained 
124mM NaCl, 3 mM KCl, 1.3 mM MgSO,, 26 mM NaHCoO;, 1.25 mM NaHPO,, 
20 mM glucose and 2mM CaCl; (all chemicals from Sigma). For targeted whole- 
cell recordings, we used a X 40 water-immersion objective, differential interference 
contrast filters (all Olympus), a digital camera (Rolera XR -Qimaging), a halogen 
(Olympus), a digidata 1440A/ Multiclamp 700B and Clampex 10.3 (Molecular 
devices). The resistance of the patch electrodes was between 3-5 MOhm. Patch 
electrodes were filled with two different internal solutions both containing 4mM 
NaCl, 10mM Na-HEPES, 10mM p-glucose, nucleotides (0.3mM GTP, 2mM 
Mg-ATP, 0.2mM cAMP) 0.15% biocytin and 0.06% rhodamine. For current- 
clamp experiments, the internal solution also contained 130mM K-gluconate, 
6mM KCl and 0.2 mM K-EGTA; in all other experiments, it contained instead: 
126 mM Cs-gluconate, 6 mM CsCl and 0.2 mM Cs-EGTA. The pH and osmolarity 
of the internal solution were close to physiological conditions (pH 7.3, 290- 
300mOsm). Data were all corrected for liquid junction potentials (10 mV). 
Electrode capacitances were compensated on-line in cell-attached mode 
(~7 pF). Recordings were low-pass filtered at 2 kHz, digitized, and sampled at 
intervals of 50 [1s (20 kHz). To control the quality and the stability of the record- 
ings throughout the experiments, access resistance, capacitance and membrane 
resistance were continuously monitored on-line and recorded. The access resist- 
ance of the cells in our sample was 21 + 1 MOhm. Electrophysiological statistical 
analysis was assisted with Clampfit 10.3, Igor Pro 6, Prism 5 and Microsoft Excel. 
Mean = standard error of the mean were reported. 

Amyloid-B, p-tau/total tau and aGSK-3f measurements. FACS-purified 
neurons were plated at 2 X 10° per well of a 96-well plate. Cells were cultured 
for an additional 5 days with a full media change on day 3. Amyloid-B was 
measured with MSD Human (6E10) Abeta3-Plex Kits (Meso Scale Discovery). 
p-tau/total tau was measured with a MSD Phospho(Thr231)/Total Tau Kit. aGSK- 
3B was measured with MSD Phospho/Total GSK-3b Duplex Kit. Fibroblast and 
neuronal amyloid-B levels were normalized to total protein levels determined by 
BCA assay (Thermo Scientific). aGSK-3f (the per cent of unphosphorylated GSK- 
36 at Ser 9) was calculated by manufacturer’s recommendations: (1 — (2 X phospho 
signal)/(phospho signal + total signal)) x 100. 

Inhibitor treatments. CPD-E (Compound-E) and DAPT were used at a final 
concentration of 200 nM. BSi-II (B-secretase inhibitor II) and OM99-2 were used 
at 10 1M and 750 nM, respectively. 1 pil of inhibitor or vehicle was added to the 
existing culture media of parallel cultures on day 4 and cultures were harvested on 
day 5. All inhibitors were from EMD Chemicals and were dissolved in DMSO. 
Endosomal analysis. 1.5 X 10° per FACS-purified neurons were plated per well of 
a 96-well plate that was seeded the previous day with 5,000 human astrocytes 
(Lonza). After 12 days of culture, cultures were stained for RAB5 and BIII-tubulin 
and imaged on a PerkinElmer UltraVIEW VoX microscope with a X60 objective 
and a z-step of 0.5 um. Quantification was performed blinded to genotype with 
Volocity software (PerkinElmer) on BIlI-tubulin® cells only. 

Statistics. Data were analysed using JMP software (SAS Institute). P< 0.05 was 
considered statistically significant. Individuals were statistically compared to the 
total NDC pool by ANOVA followed by Tukey’s test. N values signify the total 
number of separate cultures analysed, with each iPSC line contributing equally to 
the total. Drug responses were compared to controls by Dunnett’s method. 
Correlations were determined by calculating Pearson coefficients (R). 


©2012 Macmillan Publishers Limited. All rights reserved 


LETTER 


31. Takashima, A. in Current Protocols in Cell Biology Ch. 2 12 (John Wiley & Sons, 34. Emre, N. et al. The ROCK inhibitor Y-27632 improves recovery of human 
2001). embryonic stem cells after fluorescence-activated cell sorting with multiple cell 
32. Park, |.-H., Lerou, P. H., Zhao, R., Huo, H. & Daley, G. Q. Generation of human- surface markers. PLoS ONE 5, e12148 (2010). 
induced pluripotent stem cells. Nature Protocols 3, 1180-1186 (2008). 35. Takahashi, K. et al. Induction of pluripotent stem cells from adult human 
33. Akagi, T., Sasai, K. & Hanafusa, H. Refractory nature of normal human diploid fibroblasts by defined factors. Cell 131, 861-872 (2007). 
fibroblasts with respect to oncogene-mediated transformation. Proc. Natl Acad. 36. Kawasaki, H. et al. Induction of midbrain dopaminergic neurons from ES cells by 
Sci. USA 100, 13567-13572 (2003). stromal cell derived inducing activity. Neuron 28, 31-40 (2000). 


©2012 Macmillan Publishers Limited. All rights reserved 


NEWS & VIEWS 


doi:10.1038/nature10869 


PLANT BIOLOGY 


Equal-parenting policy 


During early embryo development in animals, maternal genes are expressed in preference to those of the zygote — the 
newly fertilized egg. But in plants it seems that zygote genomes switch on within hours of fertilization. 


CHRISTOPHER J. HALE 
& STEVEN E. JACOBSEN 


ertilization occurs when two gametes 
merge to form a zygote. The zygote’s 


genome comprises two sets of chromo- 
somes — one maternal, the other paternal. In 
animals, the stages of development immedi- 
ately after fertilization are disproportionally 
controlled by maternally inherited factors, 
with the zygotic genome being switched on 
gradually in waves of activation’. Whether 
plants undergo a similar maternal-to-zygotic 
transition in genomic activity has been a long- 
standing question. In contrast to previous 
work’, Nodine and Bartel’ show, in a paper 
published on Nature’s website today, that a 
plant zygote’s transcriptome — its total com- 
plement of protein-coding RNA transcripts — 
contains relatively equal maternal and paternal 
contributions. It seems, therefore, that plants 
and animals have evolved distinct strategies for 
managing the early steps in the transition from 
two gametes to an organism with maternal and 
paternal copies of the genome. 

For the plant species Arabidopsis thaliana, a 
wealth of whole-genome sequence data exists 
for closely related yet genetically distinct lines 
known as ecotypes*. To directly measure 
the respective contributions of the maternal 
and paternal genomes to the zygotic tran- 
scriptome, Nodine and Bartel crossed two of 
these ecotypes and sequenced the RNA con- 
tent of the resulting zygote. Using the genetic 
differences between the parents, they then 
matched these RNAs to either the maternal or 
the paternal genome. 

The authors observed a near-equal abun- 
dance of RNA sequences derived from each 
parental genome in the plant embryos as early 
as the one- to two-cell stage. Only a small num- 
ber of genes showed biased expression. They 
also performed reciprocal crosses — switching 
which ecotype provided the male and female 
gamete. It emerged that most of these cases of 
bias were probably due to genetic or epigenetic 
differences (chemical modifications that alter 
gene expression without affecting the DNA 
sequence) between the two ecotypes used, 
rather than being a result of which ecotype was 
used as the male or the female parent. These 
findings imply that the plant zygotic genome 
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Figure 1 | Maternal-to-zygotic genomic transition in plants and animals. a, In animals, a single 
fertilization event between maternal and paternal gametes forms the zygote (not shown). Initially, the 
animal embryo is under predominant control of the maternal genome, and the zygotic genome is only 
gradually activated over the course of embryogenesis. b, In plants, a double fertilization process generates 
the zygote and the endosperm tissue. Both the endosperm and the embryo are encased in maternal 

tissue that generates the seed coat. Chromosome symbols represent maternal (red) and paternal (blue) 
genomes. Nodine and Bartel’ show that, in contrast to the case for animals, the plant zygotic genome is 


activated almost immediately after fertilization. 


is essentially switched on only hours after 
fertilization. 

Nodine and Bartel’s results are in contrast 
to other reports” suggesting that the mater- 
nal-to-zygotic transition in plants is gradual, 
as is the case for animals. The most compre- 
hensive of these reports’, published last year, 
used a genomics approach similar to that of 
the current study, and found that more than 
80% of the Arabidopsis transcriptome in early 
embryos is derived from the maternal genome. 

What might explain such different results 
between the two studies? Nodine and Bar- 
tel’ suggest that the most likely answer is 
that the embryo samples of the earlier study 
were contaminated by maternal tissue. This 
is because, in Arabidopsis as in other flow- 
ering plants, the embryo is surrounded bya 
seed coat and other maternally derived tissue 
(Fig. 1), making the isolation of high-purity 
embryonic tissue difficult — a problem for 
the extremely sensitive genome-sequencing 
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techniques used in both studies. After observ- 
ing this maternal-contamination effect in 
pilot studies, Nodine and Bartel’ overcame the 
problem by extensively washing the isolated 
embryo cells. 

One question arising from Nodine and 
Bartel’s study is why activation of the zygotic 
genome is so different between plants and 
animals. Because the two kinds of organisms 
evolved multicellularity independently’, and 
have very different life histories, it is perhaps 
not surprising that the maternal-to-zygotic 
transition also differs between them. In plants, 
gametes are generated from cells derived from 
a pool of undifferentiated cells that are also 
responsible for generating structures such as 
leaves’*, rather than from a distinct germ-cell 
lineage as occurs in animals. In addition, ferti- 
lization itself is radically different in flowering 
plants compared with animals, with each seed 
being the product of two fertilization events 
(Fig. 1b). In this process, one fertilization event 
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forms the endosperm, a tissue that is function- 
ally similar to the mammalian placenta, and 
which contains two sets of maternal chromo- 
somes and one set of paternal chromosomes. 
A second fertilization event results in the 
formation of the embryo. It is noteworthy 
that, although Nodine and Bartel show that 
the embryo experiences equal transcriptome 
contributions from both parental genomes, 
gene expression in the embryo-nourishing 
endosperm shows extensive parental influ- 
ence’; these effects are due to uneven nuclear 
‘dosage’ as well as to epigenetic imprinting 
effects. 

Another question is how the early plant 
embryo coordinates rapid integration of two 
genomes and the concomitant activation of a 
resulting zygotic genome. In animals, a suite 
of mechanisms acts to clear maternal fac- 
tors, such as proteins and RNAs, from the 
developing zygote and to activate the zygotic 
genome”. It is unclear whether similar 
mechanisms operate in plants. Tantalizingly, 
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epigenetic processes such as DNA methyla- 
tion and demethylation, as well as regulatory 
small RNA molecules, have recently been 
implicated’ in the control of both gamete and 
endosperm development. Future studies of 
these pathways may reveal mechanisms for 
the regulation of gene expression in embry- 
onic plants. 

Understanding how a functional plant 
genome so quickly emerges from two progeni- 
tor genomes is vital for understanding plant 
development, and for informing approaches 
to plant breeding and plant biotechnology. In 
many crop species, combining two different 
parental genomes can generate regular and 
predictable hybrid vigour, known as hetero- 
sis. In some species this vigour is obvious very 
early in development’. Consistent with these 
observations, Nodine and Bartel’s work’ sug- 
gests that zygotic-genome dynamics in plants, 
including, perhaps, some aspects of hetero- 
sis, are established almost immediately after 
fertilization. m 
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Crystal structure of the channelrhodopsin 
light-gated cation channel 


Hideaki E. Kato!, Feng Zhang’, Ofer Yizhar?, Charu Ramakrishnan’, Tomohiro Nishizawa', Kunio Hirata’, Jumpei Ito’, 
Yusuke Aita‘*, Tomoya Tsukazaki!, Shigehiko Hayashi°, Peter Hegemann’, Andrés D. Maturana*, Ryuichiro Ishitani!, 


Karl Deisseroth? & Osamu Nureki! 


Channelrhodopsins (ChRs) are light-gated cation channels derived from algae that have shown experimental utility in 
optogenetics; for example, neurons expressing ChRs can be optically controlled with high temporal precision within systems 
as complex as freely moving mammals. Although ChRs have been broadly applied to neuroscience research, little is known 
about the molecular mechanisms by which these unusual and powerful proteins operate. Here we present the crystal 
structure of a ChR (a C1C2 chimaera between ChR1 and ChR2 from Chlamydomonas reinhardtii) at 2.3 A resolution. The 
structure reveals the essential molecular architecture of ChRs, including the retinal-binding pocket and cation conduction 
pathway. This integration of structural and electrophysiological analyses provides insight into the molecular basis for the 
remarkable function of ChRs, and paves the way for the precise and principled design of ChR variants with novel properties. 


Organisms ranging from archaebacteria to human beings capture 
energy and/or information contained within environmental sources 
of light by using photoreceptors called rhodopsins, which consist of 
seven-transmembrane-helix proteins, called opsins, covalently linked 
to retinal. On the basis of primary sequences, the corresponding opsin 
genes are classified into two groups: microbial (type I) and animal (type 
II). Type I opsin genes are found in archaea, eubacteria, fungi and algae, 
whereas type II opsin genes are expressed in animals, including human 
beings. The type II proteins indirectly influence transmembrane ion 
currents by coupling to G-protein-based signal transduction pathways. 
In contrast, the type I proteins (not normally found in animals) include 
direct-light-activated regulators of transmembrane ion conductance, 
such as the light-driven ion pumps called bacteriorhodopsins and 
halorhodopsins (BRs and HRs)’? and the light-driven ion channels 
ChRs’. The light-driven ion pumps have been extensively studied, 
and structure-function relationships are well known. As opposed to 
these ion pumps, very little is known about the structure of ChRs or the 
mechanism by which these seven-transmembrane proteins conduct 
cations in a light-dependent manner’. 

Beginning in 2005, it was found that ChRs could be expressed in 
mammalian neurons to mediate precise and reliable control of action 
potential firing in response to light pulses, without the need for 
exogenous retinal in vertebrate systems* °. ChRs have now been used 
to control neuronal activity in a wide range of animals, resulting in 
insights into fundamental aspects of circuit function as well as dys- 
function and treatment in pathological states'®''. However, despite 
the rapid progress of optogenetics (a technology also encompassing 
the use of ion pumps, such as HRs), virtually nothing is known about 
how a seven-transmembrane protein can form a light-switchable 
channel for cations. Although a rough helical arrangement was visible 
in the recently published ChR2 electron microscopy structure of two- 
dimensional crystals at 6 A resolution, amino acid positioning and 
insights into channel function remained completely lacking”’. A high- 
resolution three-dimensional image would be of enormous value, not 
only to enhance understanding of microbial opsin-based channels, 


but also to guide optogenetics in the generation of ChR variants 
with novel function related to spectrum, selectivity and kinetics. 
Even with limited structural models, ChR variants have been 
engineered with faster or extended open-state lifetimes'*""’, shifted 
absorption spectra’*’”"’, reduced desensitization®'**°, and increased 
expression and photocurrent magnitude*!*’”'°. These advances 
represent the tip of the iceberg in terms of what could be achieved 
for all of the above properties, as well as for altered ion selectivity and 
unitary (single-channel) conductance properties, if detailed structural 
knowledge could be obtained to facilitate true electrostatic calcula- 
tions and molecular dynamics simulations. 

Here we present the ChR crystal structure at 2.3 A resolution. This 
high-resolution information, along with electrophysiological analyses, 
has revealed the fundamentals of ChR architecture and guides the way 
to a basic working model for channelrhodopsin function. 


Overall ChR structure 


ChR2 from C. reinhardtii consists of 737 amino acids; the seven trans- 
membrane domains (TMs) and photocurrent functionality are all 
contained within the amino-terminal ~300 amino acids. To identify 
the most promising candidates for structural studies, we constructed 
and explored an extensive range of different ChRs and ChR chimaeras 
with distinct carboxy-terminal truncations. Using fluorescence- 
detection size-exclusion chromatography (FSEC)”’, we found that a 
novel chimaeric and truncated sequence termed C1C2, primarily 
consisting of ChR1 (ref. 3) without its C terminus and with the last 
two TMs swapped for those from ChR2 (related to previous 
chimaeras’*”°”* but with an additional six-amino-acid modification 
of the C terminus, namely removal of the sequence NKGTGK), was 
not only expressed well in Sf9 insect cells but also showed good stability 
and monodispersity as well as similar spectral characteristics to pre- 
vious chimaeras” (Supplementary Fig. 1). The crystals obtained from 
fully dark-adapted C1C2 in the lipidic cubic phase belonged to the 
C222, space group and diffracted X-rays to 2.3 A resolution. We solved 
the C1C2 structure by the multiple anomalous dispersion (MAD) 
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method, using mercury-derivatized crystals (Supplementary Fig. 2). As 
far as we know, this is the first example of the phase determination by 
MAD for the crystal obtained in the lipidic cubic phase. 

The truncated C1C2 chimaera (residues 1-342) is composed of an 
N-terminal extracellular domain (N domain, residues 24-83, marked in 
Fig. 1a, d), the seven TMs (TM1-TM7; residues 84-317) connected by 
three cytoplasmic loops (ICL1-ICL3) and three extracellular loops 
(ECL1-ECL3), and the C-terminal intracellular domain (C domain, 
residues 318-356) (Fig. 1b, d). In addition to the region spanning the 
N-terminal residues 1-23, which is processed as a signal peptide (data 
not shown), residues 24-48, 110-117 and 343-356 are structurally 
disordered and invisible in the electron density map, whereas the core 
transmembrane region is clearly resolved (Fig. la). Searches of the 
Protein Data Bank using the Dali server (http://ekhidna.biocenter. 
helsinki.fi/dali) suggested that the N domain, consisting of a short 
3i9-helix and two f-strands, has a novel fold. Within each C1C2 
protomer, 6 lipids and 43 water molecules were observed. 

Two C1C2 protomers were found to be tightly associated into a 
closely apposed dimer, as previously predicted from electron micro- 
scopy’. Interfacial interactions occur in the N domain, ECL1, TM3 
and TM4 of each molecule (Fig. 1b, c). Notably, Cys 66 (27), Cys 73 
(34) and Cys 75 (36) in the N domain (ChR2 numbering is shown in 
parentheses here and below for comparison with earlier literature) 
form three disulphide bonds between protomers. As Cys 73 and 
Cys 75 are highly conserved in ChRs, this interaction may contribute 
to stabilizing the N-domain interaction and molecular dimerization 
(Supplementary Fig. 3). 


Structural comparison with BR 


We next compared the C1C2 structure with that of BR and bRh 
(bovine rhodopsin). The primary sequence of ChR is similar to that 


«+ k *s 
: Sod 
Py 
ECL1 4 
a N domain 
d C domain 


N domain 


Figure 1 | Structure of C1C2 and comparison with BR. a-c, Crystal structure 
of the C1C2 dimer, viewed parallel to the membrane from two angles (a, b), and 
viewed from the extracellular side (c). C1C2 consists of the N domain, the seven 
transmembrane helices (TM1-TM7) connected by extracellular loops (ECL1- 
ECL3) and intracellular loops (ICL1-ICL3), and the C domain. Disordered 
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of BR as well as other microbial opsins, such as xanthorhodopsin and 
sensory rhodopsin II (Supplementary Fig. 4). Consistent with this 
sequence similarity, C1C2 superimposed well on BR (PDB accession 
1I1W6)”’, but not on bRh (PDB accession 3C9L)* (Fig. 1d and 
Supplementary Fig. 5). TM3-6 between C1C2 and BRare very similar, 
and the position of the protonated Schiff base is conserved (Fig. le), 
whereas there are three distinct features between the two structures. 
First, C1C2 has additional N-terminal and C-terminal domains. The N 
domain, as noted above, contributes to dimer formation, and the C 
domain may be involved in subcellular localization and scaffolding in 
Chlamydomonas; for example, to tether ChR to the algal eyespot”. 
Second, TM7 of C1C2 protrudes into the intracellular space, projecting 
~18 A from the membrane surface, and the intracellular end of TM7 is 
shifted towards the central axis of the monomer by 2.7 A, as compared 
with BR (Fig. 1d). Although the function of this protruding part of 
TM7 is unclear, His 313, His317 and Gly318 may contribute to 
stabilizing the intracellular C domain via a water-mediated hydrogen- 
bonding network (Supplementary Fig. 6). Last, and most importantly, 
the C1C2 extracellular ends of TM1 and TM2 are tilted outward by 
3.0 A and 4.1 A, respectively, compared to those of BR (Fig. le). These 
tilts enlarge the cavity formed by TM1, 2 and 7 and allow water influx 
for a cation-translocation pathway, as discussed later. 


Retinal-binding pocket, Schiff base and counterion 

As in other microbial-type rhodopsins, all-trans retinal (ATR) is 
covalently bound to Lys 296 (257) on TM7 (Lys 216 in BR), forming 
the Schiff base. As in BR, five aromatic residues (Trp 163, Phe 217, 
Trp 262, Phe 265 and Phe 269) are located around the polyene chain 
and the B-ionone ring, forming a hydrophobic pocket for ATR (Fig. 2a, 
b), whereas Cys 167 (128), Thr 198 (159) and Ser 295 (256) form a less- 
hydrophobic pocket, and may contribute to colour shift (Fig. 2a, b). 


c 


C domain 


TMS 


regions are represented as dotted lines. The ATR is coloured pink. d, e, Side 
view (d) and extracellular view (e) of the superimposed TMs of C1C2 (green) 
and BR (orange). The yellow double arrows indicate the shifts of the 
extracellular parts of TM1 and TM2. 
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Figure 2 | Structural comparison of the retinal-binding pocket between 
C1C2 and BR. a, Structure of the retinal-binding pocket of C1C2, with an omit 
map of ATR at 3o and of the surrounding residues (subtract 39 from the C1C2 
residue number to obtain ChR2 numbering) at 3.50. b, Structure of the retinal- 
binding pocket of BR. 


A previous report suggested that the side chains of Cys 167 (128) 
and Asp 195 (156) (Thr90 and Asp 115 in BR) directly hydrogen 
bond with each other”® (Fig. 2a), and that this interaction may func- 
tion as the molecular switch to direct transition to the conducting 
state. However, in the present C1C2 structure, the distances between 
the thiol group of Cys 167 and the carboxyl oxygen atoms of Asp 195 
are 4.4 A and 4.6 A, respectively, and the thiol group of Cys 167 is not 
associated with Asp 195, but with the m-electron system in the retinal 
molecule (Fig. 2a). 

In BR, a water molecule receives a proton from the protonated 
Schiff base and donates a proton to Asp 85 (ref. 27; Fig. 3b); this 
arrangement is conserved in C1C2. However, in C1C2 the distances 
from the protonated Schiff base are 4.4 A, 3.4 A and 3.0A respectively 
for the water molecule, Glu 162 (123) (Asp 85 in BR) and Asp 292 
(253) (Asp 212 in BR) (Fig. 3a, b). Therefore, in C1C2, Asp 292 or 
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Figure 3 | The protonated Schiff base and its counterions in C1C2 and BR. 
a, b, Structures of the environment around the Schiff base in C1C2 (a) and BR 
(b). Numbers indicate the distance between two atoms connected by dashed 
lines. c, Effects of the mutation of two possible counterions on the 


possibly Glu 162, but not the water located between them, may 
directly receive a proton from the protonated Schiff base. In BR, 
Asp 212 retains a low pK, because it is hydrogen bonded to Tyr 57 
and Tyr 185, which do not move during the photocycle (Fig. 3b). On 
the other hand, in C1C2, Tyr57 and Tyr 185 are substituted with 
Phe 133 and Phe 265, and Asp 292 only forms a hydrogen bond with 
water (Fig. 3a); thus Asp 292 could move relatively freely in the photo- 
cycle. Therefore in C1C2, the pK, of Asp 292 can change, in contrast 
to the corresponding residue in BR. Moreover, the pK, values of 
Glu 162 and Asp 292, calculated using PROPKA** (Supplementary 
Table 2), showed that Glu 162 may be protonated and Asp 292 may 
be deprotonated in our structure. Thus, we suggest that Asp 292, 
rather than Glu 162, is the primary proton acceptor in C1C2, consist- 
ent with the finding that Glu 123 mutants show current amplitudes 
similar to wild type’*'’. To verify further this notion, we expressed the 
E162A and D292A mutants of C1C2 in HEK293 cells, and recorded 
photocurrents in response to 465-nm light pulses (Fig. 3c, d and 
Supplementary Figs 7 and 8), revealing that replacement of Glu 162 
by alanine resulted in moderately decreased currents, whereas the 
substitution of Asp 292 by alanine almost completely abolished 
photocurrents despite robust membrane expression. Moreover, the 
onset time constant (toy) of the D292A mutant was significantly 
larger than that of wild type (Supplementary Fig. 9), consistent with 
the structure showing that Asp 292, rather than Glu 162, may be the 
major proton acceptor from the protonated Schiff base in ChR. 


Electronegative pore framed by four TM helices 

We calculated the electrostatic surface potential of C1C2, which 
revealed an electronegative pore formed by TM1, 2, 3 and 7 
(Fig. 4a). In this pathway, a number of negatively charged residues, 
including Glu 129 (90), Glu 136 (97) and Glu 140 (101), as well as 
Glu162 (123) and Asp292 (253), are aligned along the pore 
(Fig. 4b). Because most of the negatively charged residues are derived 
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photocurrent. Photocurrents on C1C2-expressing HEK293 cells were 
measured at 16 different holding potentials. WT, wild type. d, The peak 
amplitudes of the photocurrents, normalized by the cell’s input capacitance. 
Values are means and s.e.m. of 7-15 experiments. **P < 0.01, ***P < 0.001. 
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Figure 4 | Cation-conducting pathway formed by TMI, 2, 3 and 7. a, Pore- 
lining surface calculated by the CAVER” program, coloured by the electrostatic 
potential. Dashed red lines indicate putative intracellular vestibules. b, Close-up 
views of the surface of the pore, with the 17 polar lining residues (subtract 39 
from the C1C2 residue number to obtain ChR2 numbering). Hydrogen bonds 


from TM2, we suggest that the ion conductance and selectivity of 
C1C2 are mainly defined by TM2. 

On the extracellular side of the pore, a vestibule formed by the N 
domain and ECL1-3 opens up to a diameter of about 8 A, where 
Lys 154 (115), Lys 209 (170) and Arg 213 (174) form a slightly elec- 
tropositive surface around the vestibule (Supplementary Fig. 10). 
Deeper within the vestibule, Arg 159 (120), Tyr 160 (121), Glu274 
(235) and Ser 284 (245) form a weak electronegative surface and fix 
the positions of TM3, 6 and 7 by a water-mediated hydrogen-bond 
network (Supplementary Fig. 11a). As these four residues are highly 
conserved not only in ChRs but also in BRs (Arg 82, Tyr 83, Glu 194 
and Glu 204, respectively), and the corresponding residues in BR 
reportedly have an important role in proton pumping, we generated 
the R159A mutant in C1C2. We found that this mutant did not 
produce a photocurrent despite robust membrane expression 
(Fig. 4c, d and Supplementary Figs 7 and 8); because the orientation 
of Arg 159 is quite different from the corresponding Arg residue in 
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are shown as black dashed lines. c, Photocurrents of mutants of the five residues 
within the pathway, measured under the same conditions as in Fig. 3c. d, The 
peak amplitudes of the photocurrents, as in Fig. 3d. *P < 0.05, ***P < 0.001. 
Error bars represent s.e.m. 


BR, and these residues form the extracellular hydrophilic surface, we 
suggest that this conserved cluster has an important role in creation of 
the extracellular vestibule, rather than in proton movement as in BR 
(Supplementary Fig. 11b). 

In the middle of the pore, 12 polar residues (Gln 95 (56), Thr 98 
(59), Ser 102 (63), Glu 122 (83), Glu 129 (90), Lys 132 (93), Glu 136 
(97), Glu 140 (101), Glu 162 (123), Thr 285 (246), Asp 292 (253) and 
Asn 297 (258)) form a hydrophilic and strongly electronegative 
surface (Fig. 4b). To investigate contributions to ChR function, we 
measured photocurrents, kinetics and selectivity for four mutants 
(Q95A, K132A, E136A, E140A) (Fig. 4c, d and Supplementary Figs 
7-9 and 12). The K132A mutant had faster kinetics and similar 
current amplitude relative to wild type, whereas the Q95A and E140A 
mutants exhibited moderately reduced currents, and the E136A mutant 
showed very little photocurrent, despite robust membrane expression. 
Three of the four mutants (Q95A, K132A and E136A) altered ion 
selectivity (Supplementary Fig. 12); therefore, we suggest that this 
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pore is important for cation conduction and that, as previously sug- 
gested”’, Glu 136 (97) is essential. 

Although this putative cation-conducting pathway is opened 
towards the extracellular side, the cytoplasmic side of this pathway 
is occluded owing to two constrictions (Figs 4b and 5). The first 
constriction is formed by three highly conserved polar residues: 
Ser 102 (63), Glu 129 (90) and Asn 297 (258) (Figs 4b and 5a). In this 
constriction site, Ser 102, with a B-OH group that hydrogen bonds to 
the main-chain carbonyl oxygen of Thr 98, fixes the position of 
Asn 297 and, in turn, Asn 297 fixes Glu129 by hydrogen bonds. 
Glu 129 protrudes into and occludes the pore. To analyse this putative 
channel gate, we prepared four mutants ($102D, E129A, E129Q and 
N297D) and measured photocurrents, kinetics and ion selectivity 
(Supplementary Figs 7-9, 12 and 13). We found that E129A, 
E129Q and N297D affect ion selectivity and $102D, E129Q and 
N297D affect channel kinetics (Supplementary Figs 9 and 12). 
These results, consistent with previous work*”*’, indicate the import- 
ance of these three residues and suggest that cations pass through this 
constriction site in the conducting state. 

The second constriction is formed by the phenol group of Tyr 109 
(70) (Figs 4b and 5b). Given the high B-factor of the C-terminal end of 
TM1 (Supplementary Fig. 14) anda previous Fourier transform infrared 
spectroscopy (FT-IR) study showing that the o-helices undergo con- 
formational changes during the photocycle’, movement of the TM1 
C-terminal end may open the pore exit formed between TM1, 2 and 7. 
As TM1 does not directly interact with the retinal chromophore, the 
signal of retinal isomerization is expected to be transmitted to TM1 via 
movements of TM2, 3 and/or 7. However, we cannot exclude the 
possibility that movements of TM2, 3 and/or 7 form a cytoplasmic 
vestibule next to Tyr 109, and further studies will be required to 
identify the pore exit. 


Discussion 


This first crystal structure of a light-gated cation channel in the closed/ 
dark state at 2.3 A resolution provides insight into ChR dimerization, 
retinal binding and cation conductance. Moreover, owing to the large N 
domain unique to ChR, it has been difficult to align precisely the ChR 
sequence with other microbial rhodopsins (notably BR), and the present 
structure-based alignment (Supplementary Fig. 4) will assist in the 
design and interpretation of functional analyses, including electro- 
physiological and spectroscopic studies of ChR at the molecular level. 
The structural features around the ATR, Schiff base, and conduction 
pathway also provide insight into the blueshifted absorption spectrum 
of ChR (Amax = 470 nm), as compared to that of BR (A max = 568 nm). 
In general, the maximum absorption wavelength of retinal proteins is 


Figure 5 | Two constriction sites on the cytoplasmic side of C1C2 in the 
closed state. a, The first constriction site is formed by Ser 102 (63), Glu 129 (90) 
and Asn 297 (258). Hydrogen bonds are shown as black dashed lines. The blue 
dashed line represents a possible proton transfer pathway. b, The second 
constriction, made by Tyr 109. The cavity formed by TM1, 2 and 7 is occluded 
by Tyr 109, and the cavity formed by TM2, 3 and 7 is occluded by Glu 122 and 
His 173. 


determined by the energy difference between ground (So) and excited 
(S;) states, and this gap is mainly affected by the planarity of the 
conjugated system of the retinal chromophore, the distance between 
the protonated Schiffbase and its counterion, and the interaction of the 
chromophore with polar or polarizable residues*’. Although the 
planarity of the ATR is unchanged between ChR and BR, the 
counterion of ChR, Asp 292, is located ~1 A closer to the Schiff base 
than the corresponding Asp residue of BR (Fig. 3a, b), and the 
negatively charged residues are aligned along the conducting pathway 
(Fig. 4a, b). These environments are likely to stabilize the Sp state of 
ChR, thus enlarging the energy gap between the Sp and Sj states and 
thereby causing the relative absorption spectrum blueshift. 

Much about the photocycle remains unknown but is thought to be 
similar to that of BR****”°, in which the essential early event is the 
dipole change of the protonated Schiff base, which alters the nitrogen 
pK, by several orders of magnitude. In the case of ChRs, this may 
cause the release of the Schiff base proton to Asp 292 (probably not to 
either Glu 162 or water because Asp 292 is closer than these other two 
moieties, which also may explain why the D292A mutant is inactive; 
Fig. 3). The protonation of Asp 292 is likely to repel Lys 132 (93), as 
with Arg 82 in BR, and this movement may enlarge pore diameter and 
help cations to pass (Figs 3a and 4b). It is also thought that channel 
opening may be coupled with reprotonation of the Schiff base. Given 
the calculated pK, (Supplementary Table 2) and the distance from the 
Schiff base nitrogen atom, we suggest two candidates for this proton 
donor—Glu 122 and Glu 129 (Supplementary Fig. 15)—but further 
studies, including structural determinations of photocycle intermediate 
states, will be required to refine our model. 

In recent years, many strategies have been applied to create ChR 
variants with improved properties for optogenetics, ranging from 
designer ChR variants based on functional and structural similarities 
between BR and ChR (E123X, H134R, C128X, 1131V, D156A, T159C, 
C1V1)%*-1719°, to chimaera construction along with mutagenesis 
(ChRGR, L132C, ChD, ChEF, C1V1)'*”? (Fig. 6a). These approaches 
have generated a number of ChR variants with useful properties, but the 
high-resolution crystal structure is a prerequisite for the design of ChR 
variants with ideal properties. The present crystal structure describes 
the environment around the retinal-binding pocket (Fig. 6a, b), which 
will enable optimized design of red- and blueshifted ChR variants. In 
addition, structure of the cation-conducting pathway may facilitate 
construction of ChR variants with improved photocurrents, photo- 
sensitivity, cation selectivity and kinetics. For example, K132A and 
Q95A show strong photocurrents and K™ selectivity (Supplementary 
Figs 9 and 12), which could be useful to suppress neural activity. 


Figure 6 | Distribution of known mutations and possible candidates for 
future mutations. a, Mutations that affect both the absorption spectrum and 
the kinetics (Cys 167 (128), Glu 162 (123) and Asp 195 (156); deep red), the 
conductance (Thr 198 (159); light green), the selectivity (Leu 171 (132); dark 
blue) and the kinetics (Glu 122 (83), Ile 170 (131) and His 173 (134); dark cyan) 
of ChR2. b, Polar (Glu 162 (123), Thr 166 (127), Cys 167 (128), Asp 195 (156), 
Thr 198 (159) and Ser 295 (256); magenta) and non-polar (Ile 170 (131), Ie 199 
(160), Gly 202 (163), Leu 221 (182) and Gly 263 (224); orange) residues 
surrounding ATR. 
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To understand the photocycle in more detail, further structural 
studies, including determination of crystal structures in intermediate 
states, are clearly needed. However, the present structural informa- 
tion represents a key step in enabling the principled design of 
ChR variants with new properties, and will accelerate both applica- 
tions of optogenetics to intact-systems biology, and basic mechanistic 
understanding of these remarkable photoreceptor proteins. 


METHODS SUMMARY 


C1C2 was cloned into cleavable enhanced green fluorescent protein (EGFP)-Hisg 
fusion pFastBacl vector. The fusion protein was expressed in insect cells, solubilized 
in 2.5% (w/v) n-dodecyl-f-p-maltoside (DDM) and 0.5% (w/v) cholesteryl 
hemisuccinate, and purified by nickel affinity chromatography. After that, the 
C-terminal EGFP was cleaved by His-tagged tobacco etch virus (TEV) protease 
(homemade). Then the sample mixture was passed through Ni-NTA resin again to 
remove the cleaved His-tagged EGFP and TEV protease. The sample was further 
purified by size-exclusion chromatography. Crystals were grown in a lipidic cubic 
phase using monoolein. Diffraction data were measured at beamline X06SA of the 
Swiss Light Source and at beamline BL32XU of SPring-8. The structure was solved 
by the MAD method using mercury derivative. Data collection and refinement 
statistics are presented in Supplementary Table 1. Electrophysiological recordings 
were conducted using patch-clamp on HEK293 cells expressing the wild-type and 
mutant C1C2. 


Full Methods and any associated references are available in the online version of 
the paper at www.nature.com/nature. 
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METHODS 


Expression and purification of C1C2. Chimaeras between ChR1 and ChR2 from 
C. reinhardtii and other algal species were subcloned into the pCGFP-EU vector”! 
for expression in HEK293 cells. The tobacco etch virus (TEV) protease cleavage 
site, the coding sequence of enhanced GFP (EGFP), and the octa-histidine tag 
(EGFP-Hisg) were introduced at the C terminus of the chimaeric constructs. All 
constructs were screened by FSEC analysis’. The gene encoding the best 
chimaera (C1C2) was subcloned into the modified pFastBacl vector for expres- 
sion in Sf+ insect cells. Baculovirus-infected Sf+ cells were cultured in Sf900II 
(Invitrogen) at 27°C for 24h, and then the temperature was reduced to 20°C. 
Cells were harvested 72h after infection by centrifugation at 6,000g for 10 min. 
The pellets were disrupted by two passages through a microfluidizer at 15,000 
pounds per square inch, and were resuspended in a buffer containing 300 mM 
NaCl, 50 mM Tris-HCl, pH 8.0, 5% glycerol and 0.1 mM phenylmethylsulfonyl 
fluoride (PMSF). The cell debris was cleared by centrifugation at 10,000g for 
40 min, and the crude membrane fraction was collected by ultracentrifugation 
(Ti45 rotor, 43,000 r.p.m., 1h). This fraction was solubilized in a buffer contain- 
ing 300mM NaCl, 50mM Tris-HCl, pH 8.0, 5% glycerol, 20mM imidazole, 
0.1mM PMSF, 2.5% n-dodecyl-f-b-maltoside (DDM) and 0.5% cholesteryl 
hemisuccinate (CHS). The insoluble material was removed by ultracentrifugation 
(Ti70 rotor, 45,000 r.p.m., 30 min), and the supernatant was mixed with Ni-NTA 
resin (QIAGEN). After binding for 1 h, C1C2 was eluted in buffer supplemented 
with 300 mM imidazole. Following the cleavage of EGFP-His, by His-tagged 
TEV protease (homemade), the sample was reloaded onto the Ni-NTA column 
to remove the cleaved EGFP-Hissg. The flow-through containing C1C2 was col- 
lected, concentrated, and further purified by size-exclusion chromatography in 
150 mM NaCl, 50 mM Tris-HCl, pH 8.0, 

5% glycerol, 0.05% DDM and 0.01% CHS. Peak fractions were pooled and 
concentrated to ~10mgml ' for crystallization. For the mercury derivative, 
the concentrated protein was incubated with a sixfold molar excess of methyl 
mercury chloride at 20°C for 1h. The derivative was ultracentrifuged and used 
for crystallization experiments. 

Crystallization. C1C2 was mixed with monoolein (Sigma) in a 2:3 protein to lipid 
ratio (w/w). Aliquots (100 nl) of the protein-LCP mixture were spotted on a 96-well 
sandwich plate and overlaid by 1 il of precipitant solution by the crystallization 
robot, mosquito LCP (TTP LabTech). Native crystals were obtained in 30-34% 
(w/v) PEGSOODME, 100 mM Na citrate, pH 6.0, 100 mM MgCl, 100 mM NaCl 
and 100 mM (NH4)2SOg, whereas the derivative crystals were grown in 31% (w/v) 
PEGS500DME, 100 mM HEPES-NaOH, pH 7.0, 200 mM Li,SO, and 10 mM ATP. 
All crystals were incubated for 2-3 weeks in the dark. They were harvested using 
micromounts (MiTeGen), and were flash-cooled in liquid nitrogen without any 
additional cryoprotectant. 

Structure determination. X-ray diffraction data sets for the native and mercury- 
derivatized protein crystals were collected on beamline X06SA at SLS and 
beamline BL32XU at SPring-8, using a 1-j1m-wide, 15-jum-high microbeam™. 
Data were indexed and scaled with the programs XDS* and SCALA”, or with 
DENZO and SCALEPACK from the HKL2000 program suite (HKL Research). 
Experimental phases were determined by the MAD method, using the four Hg 
sites identified with the program SHELX"'. Subsequent refinements of the heavy 
atom parameters and phase calculations were performed with the program 
SHARP”. The data collection and phasing statistics are shown in Supplemen- 
tary Table 1. The initial model structure of C1C2 was built with the program 
Phyre”, using the Anabaena sensory rhodopsin structure (PDB accession 1XIO) 
as the template. The resultant structure was manually modified to fit into the 
experimental electron density maps, using the program Coot. The structure was 


then refined with the program Phenix*. Figures were prepared with Cuemol 
(http://www.cuemolL.org). 

Electrophysiology. HEK293 cells were cultured on poly-lysine-coated, glass- 
bottom culture dishes (Matsunami), and were transfected with 0.5 yg of a plasmid 
construct containing the GFP-tagged C1C2 or the GFP-tagged C1C2 mutants. At 
24-30h after transfection, the cells were placed in a bath medium, containing 
140 mM NaCl, 1 mM CaCl, 2 mM MgCl, 10 mM HEPES and 5 mM glucose (pH 
7.4 with NaOH), under an inverted microscope (Olympus IX71). Calcium and 
potassium photocurrents were measured by replacing 140 mM NaCl by 90 mM 
CaCl, or 140 mM KCl accordingly. For proton photocurrents, cell bath was 5 mM 
NaCl, 135 mM N-methyl-p-glucamine, 1mM CaCl, 2mM MgCl, 10mM 
HEPES and 5mM glucose (pH 6.4). A borosilicate patch pipette (Harvard 
Apparatus), with a resistance of about 5-8 MQ, was filled with 140 mM KCl, 
5mM EGTA, 2mM MgCl, and 10mM HEPES (pH 7.2 with KOH). C1C2 
currents were recorded in the voltage-clamp mode and in the whole-cell con- 
figuration. The cells were held at a membrane potential of —80 mV, and were 
depolarized by 10 mV voltage steps of 1.8s up to +70 mV. The light-dependent 
currents were activated 200 ms after the depolarization step, with 465 nm light 
(1.5 mW mm ”) for 1,000 ms, elicited by a high power LED illumination system 
(LEX2-B, Brainvision) connected to an A/D converter (Digidata 1440, Axon 
CNS, Molecular Devices), controlled by the pClamp10 software (Axon CNS). 
The light power was 1.5 mW mm 7. Currents were measured using an Axopatch 
200B amplifier (Axon CNS, Molecular Devices), filtered at 2 KHz, and sampled at 
5 KHz, using a Digidata 1440A digitizer (Axon CNS) controlled by the pClamp10 
software (Axon CNS). 

Fluorescence measurements. The cells were transfected with 0.5 1g wild-type 
C1C2 or C1C2 mutants, using Fugene 6, for 30 h. The cells were then washed with 
PBS and fixed with 4% paraformaldehyde in PBS for 20 min at room temperature 
(20 °C), and washed again with PBS before microscopy observation. GFP fluor- 
escence was observed with a laser confocal microscope (FV1000 Olympus). To 
estimate membrane expression of C1C2 and its mutants, the ratio between the 
membrane fluorescence and cytosolic were determined. 

Ultraviolet/visible spectroscory. Ultraviolet/visible absorption spectra were 
recorded with an Ultrospec 3300 pro ultraviolet/visible spectrophotometer 
(Amersham Biosciences) by use of 1-cm quartz cuvettes. Freshly prepared 
C1C2 was used for the measuments. pH was adjusted by addition of 100 volumes 
of buffer solution yielding final concentrations of 50mM Na citrate, pH 4.0, 
50mM Na acetate, pH 5.0, 50mM Na cacodylate, pH 6.0, 50 mM HEPES, pH 
7.0,50 mM Tris, pH 8.0 and 9.0, or 50 mM CAPS, pH 10.0, plus 150 mM NaCl, 5% 
glycerol, 0.05% DDM and 0.01% CHS. 
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Flu transmission work is urgent 


Yoshihiro Kawaoka explains that research on transmissible avian flu 
viruses needs to continue if pandemics are to be prevented. 


ighly pathogenic avian H5N1 
H influenza viruses first proved lethal 

in humans in 1997 in Hong Kong. 
Since 2003, 578 confirmed infections have 
resulted in 340 deaths (go.nature.com/ 
epb7ts). Now widespread in parts of south- 
east Asia and the Middle East, H5N1 viruses 
have killed or led to the culling of hundreds 
of millions of birds. 

To date, H5N1 viruses have not been 
transmitted between humans. Some experts 
have argued that it is impossible. But given 
the potential consequences of a global out- 
break, it is crucial to know whether these 
viruses can ever become transmissible. Work 
by my group (accepted by Nature) and an 
independent study (accepted by Science) 
led by Ron Fouchier of the Erasmus Medi- 
cal Center in Rotterdam, the Netherlands, 
suggest that H5N1 viruses have the potential 
to spread between mammals. As the risks of 
such research and its publication are debated 
by the community, I argue that we should 
pursue transmission studies of highly patho- 
genic avian influenza viruses with urgency. 

To determine whether H5N1 viruses 
could be transmitted between humans, 
my team generated viruses that combined 
the H5 haemagglutinin (HA) gene with 
the remaining genes from a pandemic 
2009 HIN1 influenza virus. Avian H5N1 
and human pandemic 2009 viruses readily 
exchange genes in experimental settings, 
and those from a human virus may facilitate 
replication in mammals. Indeed, we identi- 
fied a mutant H5 HA/2009 virus that spread 
between infected and uninfected ferrets 
(used as models to study the transmission of 
influenza in mammals) in separate cages via 
respiratory droplets in the air. Thus viruses 
possessing an H5 HA protein can transmit 
between mammals. 

Our results also show that not all transmis- 
sible HS HA-possessing viruses are lethal. In 
ferrets, our mutant H5 HA/2009 virus was 
no more pathogenic than the pandemic 2009 
virus — it did not kill any of the infected ani- 
mals. And, importantly, current vaccines and 
antiviral compounds are effective against it. 

Fouchier and his team also generated a 
transmissible HS HA-possessing virus — 
meaning that two independent studies have 
demonstrated the potential for transmissi- 
bility of HS HA-possessing viruses between 
ferrets. Their mutant H5 HA virus, gener- 
ated in the genetic background of an H5N1 
virus, did kill infected ferrets. 

Some people have argued that the risks 


of such studies — misuse and accidental 
release, for example — outweigh the bene- 
fits. I counter that H5N1 viruses circulating 
in nature already pose a threat, because 
influenza viruses mutate constantly and can 
cause pandemics with great losses of life. 
Within the past century, ‘Spanish’ influenza, 
which stemmed from a virus of avian ori- 
gin, killed between 20 million and 50 million 
people. Because H5N1 mutations that confer 
transmissibility in mammals may emerge in 
nature, I believe that it would be irresponsi- 
ble not to study the underlying mechanisms. 


H5N1 avian influenza virus particles. 


The new work has implications for pan- 
demic preparedness. There is an urgent need 
to expand development, production and 
distribution of vaccines against H5 viruses, 
and to stockpile antiviral compounds. 
Both studies identify specific mutations in 
HA that confer transmissibility in ferrets 
to H5 HA-possessing viruses. A subset of 
these mutations has been detected in HSN1 
viruses circulating in certain countries. It is 
therefore imperative that these viruses are 
monitored closely so that eradication efforts 
and countermeasures (such as vaccine-strain 
selection) can be focused on them, should 
they acquire transmissibility. 

Consequently, I believe that the benefits 
of these studies — the knowledge that H5 
HA-possessing viruses pose a risk and 
the ability to monitor them and develop 
countermeasures — outweigh the risks. 
High biosafety and security standards can 
be met. Our experiments were carried out 
in a high-containment facility by a small 
group of highly trained individuals who 
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operate under strict procedures to prevent 
the accidental release of viruses. 

However, the US National Science Advi- 
sory Board for Biosecurity (NSABB) has 
recommended that details of both studies 
(including the mutations that confer trans- 
missibility) should be restricted, and released 
only to select individuals on a ‘need-to-know’ 
basis. I acknowledge the advisory role of the 
NSABB, but I do not concur with its decision. 

The primary justification for the NSABB’s 
recommendation is that publication of our 
data “could enable replication of the experi- 
ments by those who would seek to do harm” 
(go.nature.com/nywkdy). But redacting our 
papers will not eliminate that possibility — 
there is already enough information publicly 
available to allow someone to make a trans- 
missible HS HA-possessing virus. 

The mechanism that the US government 
proposes for releasing data would also be 
unwieldy. Thousands of applications to 
access the research are likely to be filed, 
and potential background checks would 
create a huge administrative burden. We 
cannot afford to lose time if we are to combat 
emerging pandemic threats. Even if an effi- 
cient process can be established, it would be 
difficult to enforce continued confidentiality 
in the scientific community. 

By contrast, wide data dissemination will 
attract researchers from other areas to con- 
tribute to the field. This is crucial, because 
new ideas are needed to answer some of the 
most urgent questions. For example, the 
specific mutations that we identified suggest 
that influenza transmission is more complex 
than anticipated and involves not only the 
receptor-binding properties of HA, but other 
biological and physical properties. 

The redaction of our manuscript, intended 
to contain risk, will make it harder for legiti- 
mate scientists to get this information while 
failing to provide a barrier to those who 
would do harm. To find better solutions to 
dual-use concerns, the international com- 
munity should convene to discuss how to 
minimize risk while supporting scientific dis- 
covery. Flu investigators (including me) have 
agreed to a 60-day moratorium on avian flu 
transmission research (go.nature.com/ttivj5) 
because of the current controversy. But our 
work remains urgent — we cannot give up. m 
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